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ABSTRACT 

We study the clustering of galaxies detected at i < 22.5 in the Science Verification observa¬ 
tions of the Dark Energy Survey (DES). Two-point correlation functions are measured using 
2.3 X 10® galaxies over a contiguous 116 deg^ region in five bins of photometric redshift width 
Az = 0.2 in the range 0.2 < z < 1.2. The impact of photometric redshift errors are assessed 
by comparing results using a template-based photo-z algorithm (BPZ) to a machine-learning 
algorithm (TPZ). A companion paper (Leistedt et al 2015) presents maps of several observa¬ 
tional variables (e.g. seeing, sky brightness) which could modulate the galaxy density. Here we 
characterize and mitigate systematic errors on the measured clustering which arise from these 
observational variables, in addition to others such as Galactic dust and stellar contamination. 
After correcting for systematic effects we measure galaxy bias over a broad range of linear 
scales relative to mass clustering predicted from the Planck ACDM model, finding agreement 
with CEHTLS measurements with of 4.0 (8.7) with 5 degrees of freedom for the TPZ (BPZ) 
redshifts. We test a “linear bias” model, in which the galaxy clustering is a fixed multiple of 
the predicted non-linear dark-matter clustering. The precision of the data allow us to deter¬ 
mine that the linear bias model describes the observed galaxy clustering to 2.5% accuracy down 
to scales at least 4 to 10 times smaller than those on which linear theory is expected to be sufficient. 


Key words: photometric surveys - galaxies clustering - systematic effects - large-scale structure 
of Universe. 


1 INTRODUCTION 

Vast galaxy surveys trace the large-scale structure (LSS) of the Uni¬ 
verse at late times and therefore complement and improve the wealth 
of information already provided by cosmic microwave background 
(CMB) and supernovae experiments. In particular our understand- 
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ing and characterization of the cosmic accelerated expansion. Imag¬ 
ing from multi-band photometry, e.g., SDSS ([York etTT 2000 , 


PanSTARRS (|Kaiser, Tonry & Luppino|2000k KiDS I de Jong et: 
20T3), HSC ( [Miyazaki et al ||2012^ andlhe planned LSST ^Tys' 


et al.|[200^, provides the angular positions and detailed color in¬ 


formation of the galaxies. From this color information, photometric 
redshifts (photo-z) can be measured for each galaxy providing dis¬ 
tance estimates that have typically low resolution. While obtaining 
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detailed radial information requires a spectroscopic redshift survey, 
e.g., 2dF (|Colless et al.|200T) , VVDS (|Le Fevre et al.|2005^ , WiggleZ 
( [Drinkwater et al.|2010^ and BOSS jPawson et al.|2013^ , obtaining 
spectra is more time consuming. Hence the data volume available to 
a photometric redshift sample will naturally far exceed that of a spec¬ 
troscopic one. Many LSS studies have already obtained constraints 
on cosmological parameters from photo- 2 : samples, such as the sum 
of neutrino masses, the matter/energy content of the Universe, and the 
nature of dark energy ( Patoianabhan^e£ay2007| ^Htomas^^Abdalla^ 


|Lahav|2010|[Crocce et al.|2011||Ho et al.|2012||Carnero et al.|2012| 

jde Simoni et al.|2013| l. 

The Dark Energy Survey (DES, |Flaugher| ( |2005^ ) will image 
5000 deg^ of the South Galactic Cap to a depth of i < 24, recording 
300 million galaxies in 5 broadband filters (grizY), thereby provid¬ 
ing high quality photometric redshifts up to redshift 2 = 1.4 jSanchez 
et al.|2014). The DES camera (DECam, Fhmgl^et al.|201^ Diehl 


& and the Dark Energy Survey Collahoration|2012| > was installed and 
commissioned in the second semester of 2012. A Science Verification 
(SV) period of observations followed, lasting from November 2012 to 
February 2013. The DES observations officially started in late August 
2013. 


In this paper, we analyze the clustering of galaxies observed dur¬ 
ing the SV period. The SV region has been observed to match the 
nominal 24th mag depth in the i- band expected for the full DES sur¬ 
vey, and we can therefore use the data to characterize the properties 
of the galaxy samples that DES can reliably observe. We perform de¬ 
tailed systematic tests on the galaxy samples and develop methods to 
mitigate systematic effects on the measured clustering. We asses the 
impact on clustering measurements from uncertainties in photomet¬ 
ric redshift estimation by obtaining our samples using two different 
photo -2 algorithms, one based on a template fitting method and an¬ 
other on a machine learning one. Each of these methods has advan¬ 
tages and disadvantages, which we discuss. Our resulting clustering 
measurements allow us to characterize the evolution of bias as a func¬ 
tion of redshift, to assess the validity of linear bias models and to pro¬ 
vide baseline clustering measurements that other DES studies using 
SV data can use to compare against (e.g., |Giannantonio et al.|2015[ 
jSuchyta et al.|2015) . We were able to complete this work without the 
benefit of a large number of simulated DES SV galaxy samples, but 
future DES LSS studies (covering larger volumes) will rely on such 
simulations. 

The outline of the paper is as follows: we introduce the cluster¬ 
ing estimators and the theory used throughout the paper in Section]^ 
describe our data set in Sections[^and|^ and present the maps of po¬ 
tential systematics in Section]^ In Section|^we discuss the extent and 
mitigation of possible systematics, summarize and discuss our results 
in Section]^ before concluding in Section]^ 


2 ANGULAR CLUSTERING: THEORY AND ESTIMATORS 

In this section we review the modeling of the angular correlation func¬ 
tion and its covariance matrix, in photo -2 bins. We also describe the 
algorithms used to estimate these quantities from the data. 


2.1 Two-point Angular Correlations: Modeling 

Galaxy clustering can be modeled starting from the dark matter over¬ 
density field at the angular position n and at redshift 2 : 5(n, 2 ). If we 
assume a linear bias model, the projected overdensity of our galaxy 
sample is given by, 

= j b{z)^{z)S{h,z)dz. (1) 


where dnjdz is the probability of detecting a galaxy at a given red¬ 
shift (i.e. the normalised redshift distribution) and 6 ( 2 ) is the bias. The 
angular two-point correlation function between redshift bins A and B 
is then defined as, 

wab{0) = {Sg{n)Sg{n + e)} = 

= j dzi(f>A{zi) J dZ2<j)B{Z2)£,{ri2{0),z) ( 2 ) 

where r^2 = ^•( 21 )^ -I- t'( 22 )^ — 2r(zi)r(z2) cos{6) with r(z) being 
the comoving distance to redshift 2 (assuming a flat Universe), and 


,, N D{z) . .dn 


where D{z) is the linear growth factor. Equation ([^ should in princi¬ 
ple be written in terms of a spatial correlation function ^ that evolves 
with redshift. Instead we evaluate the spatial correlation at some mean 
redshift 2 and encode the growth evolution relative to this redshift 
into the selection function tf) (this is exact at the linear level) which 
we make proportional to D{z)/D{z). In turn, rather than assuming a 
parametric function for 6 ( 2 ), we will measure a single bias value at 
each tomographic photo -2 bin. Lastly, the way we estimate the red¬ 
shift distribution of the sample, dn/dz, is described in Sec. and 
shown in Eig.|^ 

We calculate the linear and nonlinear power spectra using 
CAMB (Lewis et al. 2000). For nonlinear dark matter clustering we 
use the recently re-calibrated Halof it prescription jTakahashi et al.j 
[2012^ built into CAMB. We then Fourier transform these into config¬ 
uration space to obtain ^ and evaluate Eq. Even though the effect 
of redshift space distortions is negligible for the angular scales con¬ 
sidered in this paper, we do include them in our predictions (e.g. see 
formulae in | Crocce, Cabre & Gaztafiaga|201l| . 

Throughout the paper we assume a fiducial flat ACDM-l-iz (one 
massive neutrino) cosmological model based on Planck 2013 - 1 - 
WMAP polarisation - 1 - ACT/SPT - 1 - BAG, with parameters jAde et al.| 
|20T4t : cjb = 0.0222, Wc = 0.119, oj^ = 0.00064, h = 0.678, 
r = 0.0952, Us = 0.961 and As = 2.21 • 10“® at a pivot 
scale k = 0.05 Mpc“^ (yielding erg = 0.829 at 2 = 0) where 
h = Ho/100 kms“^ Mpc“^ and uJi = Qih^ for each species i. 


2.2 Two-point Angular Correlations: Estimators 


In this paper we use two different estimators for angular correlation 
functions. One is a direct pair-count algorithm particularly suited to 
investigating small-scale clustering, and another based on pixelized 
maps using HEALPix which is particularly useful to investigate the 
cross-correlation of our galaxy sample with maps of potential system¬ 
atics (such as observing conditions) in an efficient way at the expense 
of decreased angular resolution. 

The i^r-counting algorithm uses the publicly available tree code 
ATHENAljto measure the angular correlation function with the stan- 
dardjLandy & Szalay|(|1993Tl estimator. 


w{e) 


DD - 2DR + RR 


(3) 


where DD, DR and RR represent the number of pairs of objects, 
taken from the galaxy catalogue (D) or from a randomly generated 
sample covering the angular footprint (R), which are separated by 
a scale 6 within a bin size A8. The random points are distributed 
uniformly over the footprint. As we discuss in Sec. |4.3| our angular 
footprint considers only regions of the survey that are deeper than the 
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flux limit in the sample definition, therefore the distribution of random 
points is uniform in regions where the sample is complete. 

The pixel-based estimator for the correlation between maps 1 
and 2 is given by, 


i=JVpix i = iVpi: 


wr2{9)= V V 

’ ^ '' N 1 N 2 


(4) 


where A^pix is the number of pixels used, Ni is the value in pixel 
i of the quantity of interest (either the number of galaxies or, e.g., 
the seeing), N represents the mean over all pixels, and Qij is 1 if 
pixels i, j are separated by a scale 0 within a bin size A6 and is zero 
otherwise. The sum only runs through those pixels that fall within the 
footprint. 


2.3 Covariance Matrix 


In order to estimate the covariance matrix for our w{9) measurements 
we combine two different approaches: a modeling of the off-diagonal 
elements and the jackknife (JK) method for the variance. 

The jackknife technique, which has been widely applied to an¬ 
gular galaxy clustering measurements at small scales (e.g., [Scranton 


et al. 2002[|Myers et al.|2006[[Ross & Brunner|200^ [Coupon et al. 


2012 1 works by splitting the footprint into A^jk small equal-area sec¬ 


tions and recomputing w{9) with each section removed. The covari¬ 
ance matrix between w{9) measurements at two angular separations 
between two redshift bins, A and B, is estimated by. 


^ JK,AB 

Covg. a, 


A^jk -1 

A^jk 


'^Aw"^[9i]Aw^[9j], (5) 




(6) 


where Wk is the correlation function measured with the fc-th JK region 
removed and w is the mean of the Njk jackknife w(0)’s. In our case 
we use 40 JK regions, in such a way that each patch is ~ 2 deg across, 
matching our largest scales of interest. 

Theoretical estimates of the covariance for angular clustering 
have also been explored and tested (e.g. Dodelson 1 2003 1 ; Cabre et ^ 
( [2007^ ; [Crocce, Cabre & Gaztana^ ( 2011^ ; Ross et a . j201 lb 1 ). 


They rely on the assumption that errors scale as the square root of 
the survey area ~ The covariance for a full-sky 

survey is then easily derived considering that angular power spec¬ 
trum measurements Ce are uncorrelated, Covgii = 'Va.T{Ci)Sui with 
Var(C'^) = ‘IC'ij{2£ + 1). One can then use a Legendre transform 
to express this error in real space for and arrive to. 


th 
Covg fl. 


p _ npi I 1 

= 7 — -jy-^Pe{cos9i)Pe{cos9j)iCe + l/nf 

Jsky tazTTJ 


(7) 


for auto-correlations in one bin, and 


1 

CoVfl 


TZ I] 9,)Pe{cos9j) 

Jsky (47r) 


Q 


ab\^ 


+ 


+ 


i/ua) (cr + 


1/ni,)] 


( 8 ) 


for cross-correlations between bins A and B. In Eqs. l [7|8[ l we have 
included the standard shot-noise contribution arising in the variance 
of the Cg estimates (n is the number of objects per steradian). The 
angular power spectrum between bins A and B is given by 


C/® = ^ J dkk^P{k,z)Wt{k)wP(k) 

with the kernel defined for each bin by 

Wg{k) = 




(9) 


( 10 ) 


As before, we evaluate the power spectrum in Eq. 0 at some mean 
redshift and then compute the growth within Wg with respect to this 


reference point. In Eq. 1 10 1 each bin is characterized by its dn / dz and 


h, while jg are the spherical Bessel functions. 

Measurements of the correlation function at different angular 
scales are considerably correlated. The JK estimation is limited by the 
number of independent samples one can build out of the given foot¬ 
print and hence yields noisy off-diagonal correlations. Obtaining good 
estimates of the off-diagonal elements is important, as the correlation 
w(9) between adjacent 9 bins are typically greater than 0.9. For the 
diagonal elements of the covariance matrix, we expect the JK estima¬ 
tion to be adequate and favor it over the theoretical estimation as it is 
extracted from the data itself and hence can trace effects such as non- 
linearities, the effect of the mask, and residual systematic variation. 
The theory estimate on the other hand is adequate for off-diagonal el¬ 
ements because it avoids the intrinsic noise from the limited number 
of realizations. Therefore we combine both approaches by defining 
our covariance matrix as. 


Coveije^ = CorrJf^g.ajK{9i)ajK{9j) ( 11 ) 

where (j‘jK{9i) = Cg^Bi from Eq. l|^ and Corxl^g. is the correlation 
matrix (or reduced covariance matrix) defined by 


Corr 


TH 


Cov 


TH 


<TTH{ 9 i)aTH{ 9 j) 


( 12 ) 


from Eqs. ( [7|8^ . We refer to this as the “mixed” approach. 

The methodology we use to model our covariance matrix is fur¬ 
ther justified by the results presented in a companion paper jGiannan-| 
[tonio et al.[2015[ >, where we present a discussion of various estimators 
for the covariance matrix of auto-correlations. As naively expected, it 
is found that the JK method performs better than theory errors for di¬ 
agonal correlations (i.e. variance) when both are compared to a covari¬ 
ance matrix derived from an N-body simulation. But under-performs 
compared to theory ones for the off-diagonal elements. In Appendix 
l^we show that on large scales both the JK and theory approaches 
yield similar results for the variance, and for derived best-fit biases. 


3 THE DES SCIENCE VERIFICATION (SVA-I) 
PHOTOMETRIC SAMPLE 

3.1 The Science Verification dataset 


The Science Verification (SV) observations, taken over 78 nights 
in 2012 and 2013, provided science-quality data for more than 250 
deg^ at close to the nominal depth of the survey. The SV data foot¬ 
print was chosen to contain areas already covered by several deep 
spectroscopic galaxy surveys, including VVDS dLe Fevre et al.[2005f, 
zCOSMOS ( [Lilly et al.|2007l ), and ACES ( [Cooper et al.|2012) , which 
together provide a calibration sample for the DES photometric red- 
shifts ( [Sanchez et al.[2014l l. In addition to these, two large contiguous 
regions (termed SPT-E and SPT-W, due to their partial coverage of the 
South Pole Telescope ([Carlstrom et al.|2011[| fields) were observed. 


3.2 Image processing: the DESDM pipeline 

The raw images taken each night by DECam are sent for process¬ 
ing to the National Center for Supercomputer Applications (NCSA[^ 
Urbana-Champaign, United States) using the DES Data Management 
(DESDM) pipeline described in [Sevilla et al.[ ( (20TT| l and [Mohr et al.[ 
P012[l. We briefly describe the process below. 


^ http://www.ncsa.illinois.edu/ 
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Figure 1. Footprint of the SVAl Gold catalog. In this analysis, we only use 
data from the largest compact footprint occupying the lower-right, known as 
‘SPT-E’. 


These single-epoch images are ’detrended’ by applying bias and 
flat-fields corrections. Then, cross-talk, pupil, illumination, fringing 
and non-linearity corrections are carried out. Bad pixels are flagged 
and traced in a map while satellite trails, cosmic rays and saturation ef¬ 
fects from bright stars are detected and flagged. The astrometric cali¬ 
bration is performed using SCAMP ^Bertin|2006| l, fitting the best local- 
to-celestial reference system transformation using the single-epoch 
image positions of bright sources. One then relates them to reference 
celestial positions, correcting for distortions from the large field-of- 
view. Nightly absolute calibration is performed using reference stars 
which are observed every night, to obtain a photometric solution for 
that period including the zero point, a color, and airmass terms for 
each CCD. 

The resulting reduced images are then re-mapped into a uniform 
pixel grid in which the coaddition of said images is performed. The 
purpose of this process is to obtain increased depth. 

Lastly, the catalog creation stage uses the SExtractor ( |Bertin| 
|& Arnouts|1996[ ) software , which detects objects on the coadded im¬ 
ages for each band and stores them as unique single object entries 
at a database at NCSA. During this procedure, the coadded astro- 
metric and photometric information is measured and stored as well. 
The astrometry is accurate to 0.2". A global photometric relative cal¬ 
ibration step is performed for this dataset using the exposure overlap 
across different fields, which provides repeated observations of the 
same stars so a minimization procedure can be performed jGlaze-| 
[brook et al.|1994^ . Observations from PreCAM ( jKuehn et al'|2013f 
provide a grid of standards to which this relative calibration can be 
anchored to in order to minimize large scale variation^ Finally, ob¬ 
servations of spectrophotometric standards ties this photometry to the 
absolute AB system ( [Tucker et al.|2007'| l. This final data set represents 
the “coadd” catalog. 


3.3 The Gold catalog 

The coadd catalog produced by DESDM of the SV dataset was ana¬ 
lyzed and tested resulting in the generation of the SVAl Gold catalog 
(Rykoff et al., in prep.) The footprint of the catalog in equatorial co¬ 
ordinates is shown in Fig[^ 

The basic additions of this value added catalog include: 


• Incorporating satellite trail and other artifact information to mask out 
specific areas; 

• Removing areas where the colors are severely affected by stray light 
in the images and areas with a small exposure count (at the borders of 
the footprint); 

• Removing the area below declination of —61° to avoid the increased 
stellar contamination in our galaxy catalog due to the Large Magel¬ 
lanic Cloud (LMC), as well as the area dominated by the nearby star 
R Doradus. 

• Applying an additional Stellar Locus correction jHigh et ^ ( [2009^ ) 
to tighten the calibration even further, ensuring the agreement of stel¬ 
lar colors with respect to those of reference stars. After this correction, 
magnitudes are calibrated to 2 per cent accuracy, with a 1 per cent ac¬ 
curacy in colors (Rykoff et al. in prep.). 

After this selection, about 244 deg^ remain in the catalog. 


3.3.1 Survey mask 


Concurrently with the generation of the catalog, anoth er pipeline 
builds a nominal MANGLEj^mask ( Swanson et al.|[2008t that takes 
into account the DECam CCD pointings and properties of the sky 
during each night. Artifacts such as airplane or satellite trails, cosmic 
rays, etc., and areas near bright stars are masked out. An estimation of 
the depth for the remaining regions is calculated as the magnitude at 
which an object is measured with signal-to-noise of 10 in a 2 arcsecs 
aperture. This aperture-based depth must be converted into a total- 
magnitude depth corresponding to SExtractor’s MAG_AUT0 mea¬ 
surements used in the galaxy selection. This conversion, described in 
Rykoff et al. (in prep.), takes into account properties of the data such 
as the seeing, which affects the relationship between a fixed aperture 
and a total magnitude measurement. The resulting IOct depth map is 
then translated to an averaged weighted pixelized map of resolution 
given by Aside = 4096 (an angular scale of 0.74 arcmin^) using the 
HEALPix software jCorski et al.[[2005) l. These maps will be a key 
component of our final masks (see below). See also the appendix of 
[Suchyta et al.[j2015} for a description of how the mask is constructed. 


4 THE LSS BENCH-MARK SAMPLE 

From the SVAl Gold catalog described above we select a sample of 
galaxies that enables robust clustering measurements by applying fur¬ 
ther magnitude and color cuts and restricting our analysis to the largest 
contiguous area overlapping the SPT-E fielcj^ 

60 < ra [deg] < 95 

—60 < dec [deg] < —40 (13) 

We then focused on an flux limited sample defined by 

18 < i < 22.5, (14) 

where i refers to the SExtractor’s MAG_AUT0 quantity. In Sec. [4.3[ 
we discuss that this sample is complete in regions of the survey deeper 
than i = 22.5 magnitudes. Therefore we will only consider such re¬ 
gions as our baseline footprint, covering an area of 116.2 deg^. Within 
this footprint we have 2, 333, 294 objects in the bench-mark LSS sam¬ 
ple with number density Ug = 5.6 arcmin“^. 


® This calibration is improved using the Stellar Locus correction described 
below. 


^ http://space.mit.edu/'molly/mangle/ 

® Note that we are being very conservative with potential contamination from 
the LMC by removing one extra stripe at —61 < dec [deg] < —60. 
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Figure 2. Star-galaxy separator distribution of tme stars and galaxies vs. mag¬ 
nitude from a combined spectroscopic jSanchez et al.|2014J and space imaging 
sample jLeauthaud et al.|2007^ . Color contour corresponds to the true galaxy 
locus whereas true stars are depicted as dots. Magnitude cuts for the sample 
used in this work are shown as vertical lines, together with the selection of stars 
and galaxies using the WAVG.SPREAD JXIODEL classifier in horizontal lines. 


We also perform the following additional color cuts 

— 1 < g — r < 3 

— 1 < r — i < 2 

-l<i-z<2, (15) 

in order to remove outliers in color space. While MAG_AUT0 mea¬ 
sures galaxy properties independently for each band, MAG_DETMODEL 
applies a consistent morphological model across all bands, yielding 
more accurate galaxy colors. We therefore use MAG_DETM0DEL when 
making the above color cuts on the sample. 


Photo -2 bin 

Abpz 

Vtpz 

0.2 < z < 0.4 

684,416 

441,791 

0.4 < z < 0.6 

759,015 

721,696 

0.6 < z < 0.8 

494,469 

586,510 

0.8 < z < 1.0 

270,077 

361,937 

1.0 < z < 1.2 

55,954 

93,958 


Table 1. The number of galaxies in each of our photo-z bins, when using the 
template based method BPZ (JVgpz) and when using the machine learning 
method TPZ (AiTPz)- 


jects with i < 22.5). The impact of stellar contamination on clustering 
measurements is discussed in Section [63l 


4.2 Photometric redshift estimation and redshift binning 


A priori, different photo-z codes will lead to different estimates of 
galaxy redshifts, which then will propagate into theory predictions 
and eventually into the fitted parameters. There exist two main ap¬ 
proaches for photometric redshift estimation: template fitting methods 
and machine learning ones. Each of these approaches have their ad¬ 
vantages and disadvantages, in particular they depend differently on 
how the spectroscopic set is used, which can be used for training a 
machine learning approach or to derive fitting priors on the template- 
based one. Their differences also depend on the distribution of the 
magnitude errors, the survey depth, the observing conditions and the 
galaxy population among others. 

A direct comparison of clustering observables and derived quan¬ 
tities such as bias between these two approaches is a good way of 
assessing the impact of photometric redshift estimation on cluster¬ 
ing measurements. To our knowledge this test has not been done in 
clustering analyses in the literature. In this paper, we employ two 
different algorithms to estimate photometric redshift: BPZ (Bayesian 
Photometric Redshifts), a well known template-fitting based method 
( |Bemtez|2000[ |Coe et al.|2006) and TPZ jCarrasco Kind & Brunner] 


|2013 2014| l, a high-performing machine learning algorithm for DES 
data. 


4.1 Star-galaxy separation 

The selection of galaxies (as opposed to stars) from the Gold cat¬ 
alog is achieved using a star-galaxy classifier that relies on the 
SExtractor SPREADjyiODEL parameter ( |Desai et al.|2012[|Sou-| 
[magnac et al.|2013l ) which measures the light concentration of each 
object using a linear discriminant analysis at the pixel level. This value 
is calculated for all single epoch images contributing to the final coad¬ 
ded object and the classifier is built using a weighted average expres¬ 
sion of the individual SPREAD JIODEL values. The final value is then 
termed WAVG_SPREAD.MODEL. The star-galaxy separation is done 
selecting objects with, 

WAVG-SPREADJMODEL > 0.003 (16) 

In order to assess the efficacy of this cut, we use DES obser¬ 
vations over the COSMOS field, in which we can make use of the 
measurements from the ACS instrument of the Hubble Space Tele¬ 
scope in that area as a source of ’truth’ information for morphological 
separation of stars and galaxies ( [Leauthaud et al.|2007| >, and also the 
DES observations that overlap with the spectroscopic samples that 
were used in |Sanchez et al.||2014^ (which provide truth information 
based on the spectra). The distribution of WAVG.SPREADjylODEL as 
a function of i-band magnitude for these objects is shown in Fig. 
This data allow us to determine a contamination level of 1.5% from 
stars, with a cut efficiency in the galaxy sample of 96% (for DES ob- 


4.2.1 Template Fitting method 

BP^compares the broad-band galaxy spectral energy distribution to 
a set of galaxy templates until a best fit is obtained, which determines 
both the galaxy spectral type and its photometric redshift. The details 
and capabilities of BPZ on early DES data are presented in |Sanch^ 
|et al.| j2014^ , where it shows the best performance among template- 
based codes. The primary set of templates used contains the|Coleman. 


Wu & Weedman|([l980|l templates, two starburst templates from 


Kin¬ 


ney et al. 1 1996|l and two younger starburst simple stellar population 


templates from Bruzual & Charlot] ( |2003^ , added to BPZ in |Coe et al.| 
( |2006| l. We calibrate the Bayesian prior by fitting the empirical func¬ 
tion n(z,f|mo) proposed in Benitez ^2000 ', using a spectroscopic 
sample matched to DES galaxies and weighted to mimic the photo¬ 
metric properties of the DES-SV sample used in this work. 


4.2.2 Machine Learning method 

TP^ ICarrasco Kind & Brunner 2013 20 14^ is a machine learn¬ 
ing algorithm that uses prediction trees and random forest techniques 


® http://www.stsci.edu/'~dcoe/BPZ/ 

^ http://lcdm.astro.illinois.edu/research/TPZ.html 
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to produce robust photometric redshift probability density functions 
(PDFs) together with ancillary information for a given galaxy sample. 
The prediction tree is built by asking a sequence of questions that re¬ 
cursively split the input data taken from the spectroscopic sample, fre¬ 
quently into two branches, until a terminal leaf is created that meets a 
stopping criterion (e.g., a minimum leaf size or a variance threshold). 
By perturbing the data using their magnitude errors and by taking 
bootstrapping samples, many (600 in this case) uncorrelated trees can 
be created whose results are aggregated to construct each individual 
photo -2 PDF. 

[Sanchez et al.| ( |20T4l l contains detailed information about the ap¬ 
plication of TPZ and BPZ to DES Science Verification data includ¬ 
ing the derivation of photometric redshifts used in this paper. It also 
presents a comparison of these two algorithms to numerous other ex¬ 
isting photo -2 methods. They both proved to be among the best per¬ 
forming codes in the tests presented in that work, motivating their 
use in further science analyses using DES-SV data. The 68th per¬ 
centile widths (corresponding to the scatter in the photo -2 solution) 
were found to be 0.078 for TPZ and 0.097 for BPZ, with 3a outlier 
fractions being 2 per cent for both algorithms, for detailed metrics see 
[Sanchez et al.|j25l4^ . 


4.2.3 Redshift binning 


Because we wish to study the evolution of galaxy clustering with red¬ 
shift, we split our sample in five photo -2 bins of width A 2 = 0.2, 
from Zphot = 0.2 up to Zphot = 1-2. For this analysis we use two pho¬ 
tometric redshift estimation algorithms which provide photo -2 prob¬ 
ability density functions (PDF). One is a standard template-fitting 
based code, BPZ, which we take as our default photo -2 algorithm. 
For robustness and cross-validation we also use a machine learning 
based algorithm, TPZ. Both are discussed in more detail in Sec. [4.2[ 
The number of galaxies in each photo -2 bin is given in Table[T] 

In Fig. we show the estimated “true” redshift distribution cor¬ 
responding to each of these five bins for BPZ and TPZ; we will use 
these in Eq. to make model predictions for the clustering analy¬ 
sis. These were computed by stacking the individual photo -2 PDEs 
of each galaxy in the binned sample. The accuracy of this approach 
in DES-SV data is discussed in [Sanchez et al.[ l 2014^ and its perfor¬ 
mance in simulated data in a companion paper, [Leistedt et ^ ( [2015^ . 

As can be seen by inspecting Table [T] there are some large dif¬ 
ferences in the numbers of galaxies within the same Zphot bin but 
using different photo -2 algorithms. These differences can be under¬ 
stood when also considering the reconstructed n{z) of each sample. 
For instance, in the 0.2 < Zphot < 0.4 bin, the distribution is con¬ 
siderably wider for BPZ. Thus, the bin has more objects in it, as ob¬ 
jects that are truly at higher redshift (and occupy a larger volume) are 
mis-estimated to occupy the low redshift bin. Therefore, despite their 
differences, we expect each set of galaxies to effectively probe the 
mean clustering properties of the galaxies at the effective redshift of 
the sample, when properly analyzed within the context of the dn/dz. 


4.3 Survey depth and baseline angular mask 

Within any given redshift range, minimization of the expected un¬ 
certainty on clustering measurements is a trade-off between area and 
number of objects (see, e.g., [Crocce, Cabre & Gaztanaga|[201 1[ ). A 
fainter flux-limited sample maximizes the number density of observed 
galaxies, but one should discard areas where this limit is not met in 
order to secure the completeness of the sample. The available SV area 
decreases by approximately 20 deg^ for limiting magnitudes in the 
range 20 < MAG_AUT0 < 22.5 but exhibits a sharp decrease of deeper 




z 


Figure 3. Redshift distributions obtained from stacking the photo -2 PDF of 
objects selected in five top-hat photo -2 bins using either the template based 
estimation (BPZ, top panel) or the machine learning one (TPZ, bottom panel). 


regions. For example, from the approximately 150 deg^ SV area over¬ 
lapping SPT-E region, only about 80 deg^ have a limiting magnitude 
of i = 23. 

Thus, we set the flux-limit of i = 22.5 for the definition of the 
sample and consider only the regions with i-band limiting magnitude 
MAG_AUT0 > 22.5 in our clustering analysis p] The resulting angu¬ 
lar footprint, after combining with the survey mask of Sec. [3.3.1[ is 
shown in Fig.[^ It occupies a contiguous area of 116.2 deg^. We con¬ 
sider this footprint as our baseline mask which will be used for all 
subsequent clustering measurements in this work. But we note that 
further masking may be due in some photo -2 bins to mitigate system¬ 
atic effects, this will be the subject of Sec.|^ 

We also note that our area and sample are very close to the ones 
used in the CHFTLS analysis of [Coupon et al.[ j2012^ ; with an area 
of 133 deg^ to a limiting depth of i = 22.5. However our footprint is 
contiguous, which makes the impact of the integral constraint on large 
scales negligible as opposed to measurements in the four separated 
fields of CFHTLS. The sample we analyze is the largest contiguous 
area demonstrated to have a reliable i-band depth of 22.5 for extra- 


® We have checked that the mean number density of objects in the sample 
does not change any longer if we restrict to deeper areas. 
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Figure 4. The distribution of LSS bench-mark sample galaxies over the angular footprint defined by regions with survey limiting magnitude in the i-band > 22.5. 
The sample is selected to be flux limited to i ^ 22.5 and has a mean density of 5.6 arcmin~^. All the regions considered provide at least SjN 10 measurements 
for objects at i-band = 22.5. This choice balances concerns between using the maximum depth and area possible, as described in the text. The x-axis (y-axis) 
corresponds to right ascension (declination) measured in degrees. 


galactic sources, despite being less than 3% of the size of the final 
DES footprint. 


5 POTENTIAL SOURCES OE SYSTEMATICS AND MAP 
PROJECTIONS 

Quantities that may modulate the detection efficiency of galaxies and 
produce spurious galaxy correlations have been recorded and mapped 
so that any systematic effects can be empirically studied. The maps 
used in this paper are presented below. 

Galaxy catalogs can be affected by the time-dependent fluctu¬ 
ations in the observing conditions of the survey. These fluctuations 
affect the depth in non-trivial ways because they occur in the single¬ 
epoch images, which are coadded and post-processed before extract¬ 
ing the galaxy properties. In other words, the transfer function be¬ 
tween the raw images and final catalogs is a complicated function of 
the input single-epoch images and the coadding and source-extraction 
pipelines, also coupled to the galaxy density field and astrophysical 
foregrounds. 

A significant effort is dedicated in DES to understanding the 
transfer function from intrinsic to observed quantities (see, e.g., the 
BCC-UEig framework, Chang et al.||2014] and also the Balrog 
framework, [Suchyta et al.||2015| , which will be critical to precisely 
identify sources of systematics and eliminate them in the cosmologi¬ 
cal analyses. Such techniques can in principle be used to account for 
the observational effects that cause fluctuations in the observed galaxy 
density. These methods are still maturing (though see |Suchyta et al.| 
|2015|for a working example of applying such techniques to LSS mea¬ 


surements), however, and thus for the DES-SV data we instead test the 
observed galaxy density against a large number of maps of potential 
sources of systematics (e.g., the mean seeing). These maps are then 
used to run null-tests and correct for systematic shifts in the cluster¬ 
ing measurements, as detailed in subsequent sections. This requires 
one to project the properties of the single-epoch images onto the sky, 
accounting for the geometrical overlap and also weighting due to the 
coadding procedure. This projection is fully detailed in |Leistedt et al.| 
( |2015| l, where the full set of maps, their potential applications, and 
a pedagogical example using catalogs extracted from the BCC-UEig 
simulated images are presented. 

For this paper, we consider single-epoch properties that are 
known to affect the depth and therefore might affect the clustering 
measurements. Each of these observational properties is mapped on 
the sky into high-resolution (Nside = 4096) HEALPix format maps, 
and we reduce the full set of values in each pixel into a weighted 
mean using inverse-variance weights (as in the coadd and mangle 
pipelines). The four quantities used in this work, computed in the 
gn'zKbands, are (see |Leistedt et al.|2015| for a full description): 

• FWHM: the mean seeing (in pixel units), measured as the full width 
at half maximum of the flux profile. 

• a i rma s s : the mean airmass. 

• skybrite: the mean sky brightness. 

• s k y s i gma : the mean sky bac^round noise derived as flux variance 
per amplifier in each CCD chiplj 


® In principle, the sky brightness and the mean sky background should be 
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Figure 5. Maps of potential sources of systematic errors that can lead to spatial variations in the number of observed galaxies (through depth fluctuations or sample 
contamination) or degrade the data quality itself (impacting the determination of magnitudes, colors or photo-z). The values at each pixel are computed as single 
statistical estimators of the single-epoch values for the images contributing to that map pixel. 


We will consider only the mean quantities; in the future it might 
be important to test against maps of the variance of these properties 
as well. In addition to the above maps, we consider the following: 

• Galactic dust extinction: As described above, the data 
are calibrated using the technique of stellar locus regression, which in 
addition to correcting for instrumental and atmospheric effects also 
removes reddening due to Galactic extinction. To test for residual 
effects of Galactic dust on the photometry, we include the|Schlegel,| 

IFinkbeiner & Davis| ( |1998| l dust map as a potential systematic, 
pixelized with HEALPix resolution Nside=4096. 

• USNO: Contamination of our galaxy sample by stars will affect the 
measured clustering; stellar density increases towards the galactic 
plane, following a gradient that causes significant clustering signal 
on large scales ( [Crocce et al.|201l1|Ross et al.|201 la[|Ho et al.|2012^ . 
Furthermore, because the footprint used in this work neighbors the 
Large Magellanic Cloud, stellar contamination may introduce more 
complicated spurious clustering. To investigate this effect, we include 
a map of stellar density across the field, as measured by the USNO-B1 
catalog ( [Monet et al.|2003^ . We take USNO stars with B magnitude 
brighter than 20; deeper than this limit the depth of the USNO 
catalog varies across the field. Although this catalog is brighter 
than our sample, we expect the stars in our sample to trace the same 


strongly correlated. We test against both as a validation that this process has 
worked properly. 


galactic density distribution as those in the USNO catalog. Due to low 
statistics we use a coarser map HEALPix resolution of Nside= 256[3 

• depthjnask: As described in section [43] we cut our sample at a 
magnitude of MAG_AUT0 < 22.5 and use a footprint where the data 
provides > IOct measurements at that depth. We include the depth 
map as a potential systematic in order to probe possible inaccuracy of 
the map and incompleteness at the faint end of the sample. 

• chi2_psf_fit: The coadded image on which we base our 
photometry can have complicated structure, as the number of input 
images varies across the field and each input image has a unique PSF. 
It is therefore difficult for the photometric reduction to characterize 
the image and its PSF across an entire coadd unit. Poor characteri¬ 
zation of the coadd image quality at the location of a star will cause 
the star to be poorly fit by the PSF model. In locations of a poor PSF 
fit to stars, galaxy photometry will also be adversely affected by the 
inaccuracy of the estimate of the object’s size. We therefore use a 
map of the average of objects’ fits to the PSF model to test against 
this potential source of systematic uncertainty. We only consider very 
bright objects, with 16 < i < 18, to ensure that the large majority of 
them are stars. 


We also tested results against a sample of fainter DES detected stars and 
found no significant differences. 
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• chi2_detmodel_fit: MAG_DETMODEL magnitudes are mea¬ 
sured by fitting a Sersic profile, convolved with the PSF model, to the 
object image. Poor MAG_DETMODEL fits over a region of data would 
indicate photometric systematic errors in the same manner as poor 
PSF fits, as described above. We therefore also test our data against a 
map of the average ')d of the detmodel magnitude fits; we expect 
this map to be somewhat degenerate with the chi2_psf _f it map. 

We show some of the most relevant maps in Fig.|^ 


6 MITIGATING SYSTEMATIC EFFECTS IN ANGULAR 
CLUSTERING 

In this section we give a detailed explanation of our general procedure 
to address the impact of potential systematic effects due to varying 
observing conditions, stellar contamination, or any other source of 
spurious density fluctuations across our footprint. We assume that all 
potential sources of systematics are encoded in the maps described in 
Section|5] 

Let us first describe our step-by-step procedure in broad terms 
with more details and results given in subsequent sections, 

[1] Galaxy Density vi. Potential Systematics : Our first step 
is to study the galaxy number density in each tomographic bin as 
a function of the value of each potential systematic variable. If the 
galaxy density is independent of the value of the given potential 
systematic we do not consider it as impacting our data. Otherwise 
we try to either mask it or correct for its impact in the clustering 
measurement, as shown below. 

[2] Bad Regions Masking : The relation between galaxy density 
vs. potential systematics can take several forms. If this relation is 
such that the galaxy density is constant as function of the potential 
systematic and changes sharply after some threshold value, we mask 
the regions of such anomalous dependence (such as bad seeing, or 
high airmass) to minimize its effect. This defines a veto mask that we 
then use in all subsequent clustering analysis. The quality of the data 
is such that we typically mask only small fractions of the footprint. 
We will only mask data worse than a given quantity, as when the data 
quality is worse, we naturally expect photo-zs, star/galaxy separation, 
object detection, etc., to all perform worse and thus be more likely to 
cause spurious fluctuations in the observed galaxy density. 

[3] Clustering corrections using Cross-correlations : If the 
relation between galaxy density vs. potential systematic is smooth 
and close to linear (after imposing the veto mask discussed above) we 
use cross-correlation between galaxies maps and the observational 
maps to correct our measurements. 

[4] Stellar Contamination : As described in Section |4.1| we 
estimate 1.5% of the SV galaxy sample is comprised of mis-classified 
stars. The clustering of stars over the SV footprint is close to zero, 
implying that the main effect of the contamination is to proportionally 
remove power from the w{9) measurements. We estimate the stellar 
contamination in each tomographic bin and derive how to correct 
our clustering measurements given such contamination in Section[63] 


6.1 Galaxy Density vs. Potential Systematics 

Our first step is to study how the galaxy density depends on each 
potential systematic variable, if at all. The starting point for these 
measurements is pixelized maps of the distribution of galaxies and 


of the potential systematics. We divide the range of the values of each 
potential systematic variable into roughly ten bins. For the sky area 
corresponding to each bin, we calculate the mean galaxy number den¬ 
sity normalized by the mean across the whole footprint. In Fig. we 
show the resulting normalized galaxy density as a function of the dif¬ 
ferent potential systematic variables. After examining all 2 bins, we 
only show the relevant cases. Error bars are computed using the jack¬ 
knife method by splitting the footprint into 100 regions. Hence er¬ 
rors in Fig. might be under-estimated due to a limited sampling of 
large-scale fluctuations. Furthermore, note that extreme values of the 
potential systematics are typically not sampled in most JK regions. In 
such cases (i.e. the first and/or last points in each panel of Fig.|^ we 
choose to estimate error bars as Poisson sampling noise. 

We perform this procedure in every redshift bin, and define an 
angular mask due to systematics per bin. These are: 

• 0.2 < 2 : < 0.4: The top left panel of Fig. shows the relation 
of galaxies selected in this bin with BPZ as a function of FWHM 
in ^-band and 2 -band. For high values of seeing we see a clear 
drop of galaxy density and therefore choose to remove these areas. 
We cut regions with g-band FWHM > 1.34 arcsecs (1.09 of the 
mean g-band FWHM) which removes 13% of the area. Similarly, 
we remove area with 2 -band FWHM > 1.24 arcsecs (1.12 of the 
mean 2 -band FWHM) which removes a further 6%. In all 19% of 
our nominal footprint is masked for the BPZ sample at this photo -2 
bin. In the top left and right panels of Fig. we show histograms of 
FWHM shading the portion removed by this selection (at the same 
time. Fig. shows the maps for FWHM in all bands, providing an 
idea of which regions are removed). The same procedure but using 
galaxies selected with TPZ yields the top right panel of Fig.|^ In this 
case we remove i-band FWHM > 1.28 arcsecs (1.11 of the mean 
i-band FWHM). Note that there is also a dependence with 2 -band 
FWHM. However after the regions with worst i-band FWHM are 
removed, this dependence disappears. This is shown by the triangular 
symbols with dashed error-bars . Hence for TPZ we only mask bad 
i-band FWHM regions removing ~ 11% of the original footprint. 
The distribution of i-band FWHM is shown in the bottom right panel 
of Fig.|^ where the masked out part has been shaded. 

• 0.4 < 2 < 0.6: For this bin only the g-band FWHM affects the 
galaxy density in a way suitable for masking. It does so in a very 
similar manner for both photo -2 samples, as shown in the bottom left 
panel of Fig. Hence we define a common veto mask for BPZ and 
TPZ removing regions with g-band FWHM > 1.34 arcsecs (1.09 of 
the mean gr-band FWHM). This removes 13% of the area. According 
to Fig. |7] the threshold value could be smaller because the relation 
turns around at ~ 1.28 arcsecs. We have checked that this extra 
masking does not change the final clustering results, but it implies 
removing about 30% of the area. Hence we cut g-band FWHM 
> 1.34 arcsecs what should account for the bulk of the effect. Note 
that after removing this region there is a clear dependence of galaxy 
density increasing with y-band FWHM. This residual is exactly the 
type that can be addressed by means of cross-correlations between 
galaxies and potential systematics, as detailed in Sec. |6.2| 

• 1.0 < 2 < 1.2: For the highest redshift bin we again define one 
additional mask for BPZ and one for TPZ based on Fig. bottom 
right panel. For BPZ we remove regions with r-band FWHM > 1.28 
arcsecs (1.08 of the mean r—band FWHM) which amounts to 6% of 
the original footprint (leaving 109.2 deg^). For TPZ we cut regions 
with i-band FWHM > 1.28 arcsec (1.1 of the mean i-band FWHM 
and 10% of the area). Above these threshold values the density of 
galaxies drops sharply compared to the mean. 

For the bins at intermediate redshifts (0.6 < 2 < 0.8 and 
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Figure 6. Galaxy density (normalized to the mean over the footrpint) as a function of the value of different potential sources of systematic errors, in regions where 
those galaxies reside. This is displayed for our two photo-z estimators and different tomographic bins. We only show those cases where the density of galaxies drops 
steadily after some threshold value indicating regions that may lead to systematic effects. To minimize them we mask these regions on top of our nominal mask (as a 
function of photo-z bin). Values masked out in each case are indicated by the inset boxes in blue/red. Cases where this relation is roughly linear (e.g. 0.4 < z < 0.6 
after high p-band FWHM values are masked) induce a change to the clustering measurements that can he corrected for using the cross-correlations between the 
maps for galaxy distribution and the ones for systematic effects (see Sec.|6.2^ 


0.8 < 2 < 1 . 0 ) we find no dependence of the data with the poten¬ 
tial systematics. Thus, we do not correct the clustering or introduce 
additional cuts to the nominal footprint when analyzing these bins. 


6.2 Galaxy-Systematics cross correlations 

Once regions with bad observing conditions that translate into an 
anomalous drop of the density of galaxies have been identified and 
masked out we devote our attention to correcting more subtle de¬ 
pendencies where the density varies roughly proportionally with the 
strength of the systematic. For this we follow and extend the work 
of |Ross et al.| ( [^lla^ ; |Ho et al.| j2012[ t. We start by assuming that 
fluctuations in the potential sources of systematic effects Jgys will in¬ 
duce spurious fluctuations into the observed galaxy number density 
(^obs = ngjug, such that 




^syst / (^.syst) 


^sy.st / (^syst) 


^obs — (^true H" ^ ^ OLi Jg- 


Figure 7. Distribution of FWHM weighted co-added values in different bands 
across our footprint. The shaded parts of the histograms correspond to regions 
where large decrement of galaxy density in different photo -2 bins is found. To 
avoid a systematic effect they were removed from the con'esponding clustering 
analysis. See Sec.|6.1|for further details. 


where Jtrue are the true underlying fluctuations we want to recover 
and i refers to any of the maps described in Section]^ By definition 
we assume that true galaxy fluctuations do not correlate with the ones 
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induced by the systematicj^ 


((^obs (^sys) ■ ^ ^ (^sys^sys)- 

i 

(18) 


If we define, 



‘Across,j — (^obs ^sys)? 



'^autOjij = (*^sys ^sys)5 

(19) 

S' 

¥ 

then Eq. ([Tslt can be written as, 


X 


o 

U)cToss,j '— ^ ^ 'Wsbuto,ij ■j 

( 20 ) 

o 


i 


and the solution is a matrix inversion, 

Cr = tUcross ' (tUauto) • (21) 

Hence if we have 6 systematic maps we have to invert a 6 x 6 matrix 
for each 6-b'm (notice that we are a priori assuming that a is spatially 
independent to factor it out of the correlations). From Eq. (TTJ one 
can work out the auto correlation of true fluctuations 
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Figure 8. Different two-point con'elations entering the clustering con'ection in 
the bin 0.2 < z < 0.4 for the BPZ sample due to spatial variations in i-band 
FWHM over the DBS footprint. See Sec. |6.2| for details. 


which reduces to the standard case, 


m(0)true — tn(^)obs tUcross/tUauto (24) 

for just one systematic. 

Note that our starting assumption in Eq. ( |17[ of a roughly linear 
relation between (5obs and Says can be tested and confirmed with the 
one-dimensional relation of galaxy density as a function of potential 
systematics as done in Sec. |6.1| If strong non-linear dependencies are 
found then Eq. l |17[ will not apply and the corrections induced will 
bias the measurements. For this reason, it is important to mask the 
regions (e.g., high ^-band seeing) where the relationships are most 
clearly non-linear, as is done in the previous section. 

In addition, note that our approach does account for the correla¬ 
tion among the potential systematics themselves. Nevertheless, solv¬ 
ing for an arbitrarily large number of maps might still induce over¬ 
corrections and biases due to the inversion in Eq. \23\ and the noise 
of the measurements particularly on large-scales. 

Further below we explain our quantitative criteria based on cross¬ 
correlations to choose which maps impact the galaxy sample and need 
to be corrected for. Then we follow a combined approach in which we 
also cross check against the density relations as in Sec. |6.1| for those 
systematics that pass the criteria that a linear relation exists. 

To use a quantitative criterion to select the most relevant sys¬ 
tematics maps, we cross-correlate them with the galaxy distribution 
maps at each redshift bin after applying the masking described in Sec¬ 
tion | 6 T] For each potential systematic we calculate the correction to 
the galaxy correlation according to Eq. Waross/'Wauto, ignoring 
cross-correlations among systematics to begin with. This correction 
is determined for each angular bin 6. Using JK resampling over the 
footprint, we calculate an error on the correction. Then, if the correc¬ 
tion is inconsistent with zero at a Ict levej^ the systematic is deemed 

This means we must be careful in choosing systematics: e.g. photomet¬ 
ric redshift error is typically smallest for luminous red galaxies, which cluster 
strongly; we therefore must not use a map of redshift error (or, similarly, mag¬ 
nitude error which correlates with galaxy luminosity) as a systematic. Other¬ 
wise the correlation can be strong, see for examplelMartf et al.|j2014|. 

We use 10 logarithmically distributed angular bins in U.12 — 4 deg. The 
1(T limit corresponds to > 11.53, where the x^ is computed using the 
covariance of the estimates in the JK regions. 


significant and taken into account in further analysis; otherwise, it is 
neglected. 

Figure shows as an example the correlation between the cen¬ 
tral redshift bin, 0.2 < 2 < 0.4 for the BPZ sample and the FWHM 
map in the i band. The figure shows the auto-correlation of the galaxy 
sample, the auto-correlation of the FWHM map, the cross-correlation 
between the data and the systematic, and the corresponding correc¬ 
tion that should be applied to the data (ignoring covariance with any 
other systematic effects), see Eq. ( |24[ . The correction is significantly 
non-zero, and therefore we consider the i-band FWHM map to be a 
relevant systematic in our analysis. 

For the BPZ photo -2 catalog the systematics that correlate sig¬ 
nificantly with the data set are: 

• 0.2 < 2 < 0.4 ; i-band r-band and 2 -band FWHM, r-band Skybrite 
and dust extinction (5 maps) 

• 0.4 < 2 < 0.6 : g-band and r-band FWHM (2 maps) 

• 0.6 < 2 < 0.8 : None 

• 0.8 < 2 < 1.0 : None 

• 1.0 < 2 < 1.2 : None after r-band FWHM masking 

For the TPZ photo -2 catalog the systematics that correlate sig¬ 
nificantly with the data set are: 

• 0.2 < 2 < 0.4 : r-band FWHM, r-band and 2 -band Skybrite (3 
maps) 

• 0.4 < 2 < 0.6 : r-band FWHM (1 map) 

• 0.6 < 2 < 0.8 : None 

• 0.8 < 2 < 1.0 : None 

• 1.0 < 2 < 1.2 : None after i-band FWHM masking 

Despite the fact that the limiting depth is the same in all of the 
redshift bins we use, we find slight variations in the type and degree 
of systematic contamination as a function of redshift. The corrections 
arise mainly at low redshifts, hence they might be due to a correlation 
between the observing conditions and the determination of photomet¬ 
ric redshifts (the DES filter systems does not contain M-band, which 
degrades the low redshift photo- 2 ). Nonetheless, the significance of 
the corrections is never beyond 2a at any given angular scale which 
signals that the data is not very impacted by systematics and our re- 
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Figure 9. The purity of the DES galaxy sample in the COSMOS field in each 
of the BPZ tomographic redshift bins we use, determined using the default cut 
on WAVG.SPREAD.MODEL > 0.003. The stellar contamination is at most 2%. 


suits not dominated by these corrections. This is discussed in more 
detail in Sec.|7]and Tahles|^and[3] 


6.3 Stellar Contamination 


Stellar contamination will affect the measured clustering signal, even 
if the cross-correlation between the galaxies and stars is negligible. 
This is due to the fact that rather than modulate the selection function 
of galaxies, stellar contamination introduces a separate population. In 
the limit where the stars are un-clustered and the stellar contamination 
is constant the observed galaxy density is, = (1 — fstar)Sgai and 
thus 


'W^gal — 


Wo 

(1 - /star)2 ’ 


(25) 


i.e., the measured clustering is diluted by the square of the purity of 
the galaxy sample. 

We derive a general expression for the clustering of a galaxy pop¬ 
ulation, given some stellar contamination (full details are shown in 
Appendix [D]) 


tUgaZ — (1 “b /star) ( tUo /startnstar,S 


r4 

J star 


(1 + /star)^ 


(26) 


where in this formalism, the number density of contaminating stars is 
allowed to be a function of not just the total density of stars, but also 
the survey observing conditions. 

The cross-correlation tests presented in previous sections did not 
find stellar contamination to be a significant systematic contaminant. 
This suggests that the term in Eq. |26| proportional to tUatar.s is con¬ 
sistent with zero. However, the terms (1 -|- /star)^mo — fsta.T remain, 
and thus any stellar contamination must be accounted for. 

We estimate the stellar contamination in each photometric red- 
shift bin by using DES observations in the COSMOS field. This re¬ 
gion contains space-based observations which identify matching DES 
sources as point-like or not (as detailed in Section [4T| l. Figure|^shows 
the purity of the DES sample in the COSMOS field and its uncertainty, 
that we estimate in each redshift bin for the classifier discussed in 
Sec. |4.1| The uncertainty includes the statistical Poisson variance, the 
error due to sample variance fluctuations of the galaxy sample, which 
impacts the purity too, and the fluctuations of the stellar sample in the 
SPTE area. These are large due to the presence of the Large Magel¬ 
lanic Cloud, and are included when extrapolating to the SPTE field. 
This extrapolation to the DES SPTE area is done by finding the den¬ 
sity of highly probable stars in the COSMOS field with al9<i<21 


cut and |WAVG_SPREADjyiODEL)| < 0.001 and we find that the av¬ 
erage density is the same as that found in the DES SPTE area when 
applying the same cuts, to within 10% (below our error bars due to 
intrinsic fluctuations of the galaxy and stellar samples). We apply the 
correction given by Eq. 1 26 ' to the 'w{9) measurements and propagate 
this uncertainty into measurements of the galaxy bias. The correction 
to the bias is largest in the 0.6 < 2 < 0.8 bin, but is at most 2%. 


7 RESULTS 

We now discuss different aspects of our angular clustering measure¬ 
ments in the tomographic bins, in particular the corrections for sys- 
tematics discussed above. Let us first note that in what follows we 
define the best-fit bias b and Icr error at, in the standard way, i.e., 
given 

wt(^))Cov“^(0, 6'){w{ 9') — b^ wt{9')) 
e,e' 

(27) 

we first find its minimum to define b and then vary it around this 
value such that Ax^ = 1 with Ax^ = + <^b) — The 

sum runs through a range of scales that will be specified accordingly 
while the covariance has been defined in Sec. |2.3| as the “mixed” ap¬ 
proach, see Eq. (n}. In Appendix [B] we discuss the best-fit bias re¬ 
sults when different ways of estimating the covariance are considered. 
Note that we do not perform joint fits including multiple redshift bins, 
hence Eq. l |27^ does not include the covariance between different red¬ 
shift bins. The b^ wt refers to the modeling of Eq. (|^, where wt 
is the underlying dark-matter clustering multiplied by a linear bias b 
to yield the model galaxy-galaxy correlation function. If wt is com¬ 
puted starting from the linear theory power spectrum, we call this 
model “linear growth”. If instead this is computed with the nonlin¬ 
ear prescription Halof it jTakahashi et al.|2012[ ), but still maintain¬ 
ing a linear bias, we refer to it as the “linear bias” model. On large 
scales Halofit reduces to linear theory, while on small scales the 
effects of non-linear growth increase the model correlation above its 
linear value. The smallest scale at which wt from Halofit equals 
Wt from linear theory will be referred to as “scale of linear growth”. 
These are reported in the caption of Tablej^for each of the five photo- 
2 bins. 

Tables|^and|^show the result of fitting a “linear growth” model 
to the clustering measurements before any systematic correction is 
applied (second column, labeled “baseline mask”), after a subsequent 
masking of regions with the worst observing conditions, as discussed 
in Sec. |6.1 [ (third column, labeled “-I- bad regions masking”) and lastly, 
after the remaining correlation with relevant maps is corrected for as 
detailed in Sec. |6.2| (fourth column, “Gal-Syst cross correlations”). 
Table|^corresponds to the BPZ photo -2 catalog, while Tablej^to the 
TPZ one. 

The corrections for systematic effects lower the clustering am¬ 
plitudes and hence the derived best-fit biases. They are mainly im¬ 
portant for low redshift, in particular the 0.2 < 2 < 0.4 bin. Here 
the BPZ best-fit bias goes down by Sb/ai, ~ 3, while for TPZ the 
change is Sb/at ~ 1.6. For the 0.4 < z < 0.6 bin the change in 
bias is 0(ab). These two redshift ranges are important for the com¬ 
bination of galaxy-galaxy clustering with weak lensing observables, 
which typically use lens samples at low- 2 . Hence these corrections are 
likely to be necessary to produce an unbiased cosmological analysis 
using such a combination of probes, e.g., a spuriously large clustering 
amplitude could translate to a spuriously high measurement of erg. At 
higher redshifts, which we expect to be more suitable for future BAO 
studies due to larger volumes and better photo -2 performance, we do 
not find significant observational systematic biases affecting our mea- 
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Figure 10. Angular auto-correlation functions, w(9), in photo- 2 : bins of the BPZ estimator for our flux limited sample (i < 22.5) spanning a broad range in redshift, 
from 2 : ~ 0.2 to 1.2. Blue circles correspond to the measurements after correcting our sample for systematics as discussed in Sec.[^ while red squares is before such 
corrections (only shown when relevant). Solid lines correspond to a linear bias model applied to the non-linear matter w{6) computed with Halof it. The best-flt 
bias displayed was obtained from fitting the range of scales shown in each case (main panels). Dashed lines correspond to linear theory, with the same value of the 
bias. Note how the simple “linear bias” model describes the clustering towards scales considerably smaller than the linear regime shown by dashed lines. The inset 
panels show the performance of the best-fit “linear bias” model towards smaller scales than the ones used in the fit (main panels), see text for a detailed discussion. 


BPZ (template method): best-fit bias and \<j error 


Photo-z 

Bin 

Baseline 

Mask 

- 1 - Bad Area 
Masking 

+ Gal-Syst 
Cross-Corr 

X^l 

d.o.f. 

0.2 < z < 0.4 

1.28 ±0.07 

1.20 ±0.07 

1.05 ±0.07 

2.5/7 

0.4 < 2 < 0.6 

1.25 ±0.05 

1.26 ±0.05 

1.23 ±0.05 

8.3/8 

0.6 < 2 < 0.8 

1.35 ±0.04 

- 

- 

2.3/9 

0.8 < 2 < 1.0 

1.54 ± 0.02 

- 

- 

10.3/10 

1.0 < 2 < 1.2 

2.20 ± 0.07 

2.17 ± 0.09 

- 

3.6/10 


Table 2. Impact of the different connections for obseiwational systematic ef¬ 
fects on the derived best-fit bias b for the tomographic bins selected with our 
template method. The baseline mask is described in Sec. |4.3| and corresponds 
to all regions where our sample is complete (i.e. lOcr depth i >= 22.5). 
The third column corresponds to an additional masking of regions with high 
values of potential systematic variables such as seeing, where we observe 
large decrements in galaxy density (as described in Sec. |6.1) . The fourth col¬ 
umn refers to further corrections to w(6) after these masks, in cases where 
the data still correlates with maps for potential systematics (as discussed in 
Sec. |6.2} . The fifth column reports the x^/d-o.f after all corrections ap¬ 
plied. Empty entries refer to cases where such corrections were not necessary. 
These values were obtained after fitting the “linear growth” model for scales 
9 > 9min = (0.26,0.18, 0.12, 0.08, 0.06) deg., from first to last 2 : -bin. 

surements, at least for the range of angular scales probed in with our 
current data. 

Figure |T^ shows the angular correlation function measurements 


TPZ (machine learning method) 


Photo-z 

Bin 

Baseline 

Mask 

-H Bad Area 
Masking 

+ Gal-Syst 
Cross-Corr 

X^/ 

d.o.f. 

0.2 < 2 < 0.4 

1.18 ±0.07 

1.13 ±0.08 

1.07 ± 0.08 

2.1/7 

0.4 < 2 < 0.6 

1.29 ±0.04 

1.30 ±0.04 

1.24 ± 0.04 

6.7/8 

0.6 < 2 < 0.8 

1.34 ±0.05 

- 

- 

14.5/9 

0.8 < 2 < 1.0 

1.56 ±0.03 

- 

- 

3.7/10 

1.0 < 2 < 1.2 

1.97 ±0.09 

1.96 ±0.06 

- 

4.5/10 


Table 3. Same as Table[^but for the tomographic bins selected with the ma¬ 
chine learning method. Different algorithms for photometric redshift estima¬ 
tion use different data quantities (most notably, template based ones use mag¬ 
nitude errors while most machine learning do not). Thus one expects a different 
response to potential systematics. 

in the five consecutive photo-.^ bins in which we have split our sample 
(for the BPZ sample). Blue circles show our measurements after ap¬ 
plying all of the corrections for observational systematic effects that 
we have detailed, while the square red symbols show the measure¬ 
ments before any correction. As is clear from Fig.Q^these corrections 
do not affect the small scale clustering, and are most important at low 
redshifts. The dashed curve in each panel corresponds to a linear the¬ 
ory model (“linear growth”), while the solid curve uses the non-linear 
Halofit prescription (“linear bias”). Each model curve is for the 
bias given in Table[^ fit over the range of angular scales indicated in 
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Figure 11. Comparison of the large-scale bias measured in a DES-SV flux limited sample (i < 22.5) to equivalent measurements from CFHTLS derived from 
Coupon et al. (2012). We present DES results for two different photometric redshift catalogs, one obtained using a template method (BPZ), another with a machine 
learning approach (TPZ). The overall agreement between the two DES samples as a function of redshift is better that 2 per cent for z < 1. At 2 > 1 is difference is 
not statistically significant (~ 2a). This represents a non-trivial test for DES-SV photometric redshift estimation. Our results are also in good agreement with those 
from CFHTLS, with x^/d.o./ = 4/5 for TPZ and 8.7/5 for BPZ, representing a cross-validation of data quality and sample selection. 


its the caption. In Sec. |7.3| we discuss the angular range of validity of 
a linear bias prescription based on Halof it, therefore in Fig.|10|we 
have chosen to display the clustering measurements to scales that go 
beyond the linear regime. 

There is good qualitative agreement between the linear bias 
model and the observed clustering at the scales shown in the main 
panels of Fig. [T^ (we discuss this more in quantitative terms below 
in Sec. |7.3^ . This result is interesting because it implies that, at least 
for projected clustering in angular coordinates, the scale of non-linear 
biasing is different from the one of non-linear dark matter clustering. 
The latter is currently better understood, so in general terms this result 
is relevant. We will come back to it in Sec. 17.31 


We further note that at large scales {9 > 2°) all the correlations 
tend to zero. This is a signal that systematic effects are under control, 
as systematic variations tend to introduce spurious large-scale power 
( |Ross et al.|2011a||Ho et al.|2012[[Leistedt et al.|2013^ ). 

In the inset panels of Fig. [T^ we show the clustering measure¬ 
ments at smaller angular scales and using a log scaling for w{9). 
The model curves are merely extrapolations using the best-fit bias 
recovered on larger-scales (main panels). Qualitatively, this simple 
model does not depart strongly from the measurements. The data at 
these scales will be used to constrain the halo occupation distribution 
of DES SV galaxies in Sobreira et al. (in prep.). However, the gen¬ 
eral agreement signals that a more elaborate non-linear bias prescrip¬ 
tion based on perturbation theory (Fry & Gaztanaga|1993[|McDonald| 


|2006[ jChan, Scoccimarro & Sheth||2012[ l might be adequate to de¬ 

scribe the data in this high signal-to-noise regime. Such study is left 
for future work. Instead, in Sec. |7.3| we will show that the current data 
size and quality of DES-SV is able to distinguish the breakdown of a 
“linear bias” prescription in detail. 


Lastly we note that we have chosen not to show the clustering 
for the TPZ sample in a manner analogous to Eig. [T^ as the results 
look almost identical at the qualitative level. 


7.1 Bias Evolution and Comparison to CFHTLS 

The DES science verification data used in this paper (and in a series of 
papers accompanying it) is in several regards similar to that collected 
by the CFHTLS collaboration, for example in depth, photometry and 
area. CFHTLS has, to this point in time, been regarded as the state of 
the art for deep wide area photometric data. Hence in this section we 
compare our clustering measurements to those presented by [Coupon] 

lerai ( |M^ . 

Galaxies in jCoupon et al.| ( |2012| l are selected according to SEX- 
tractor MAG_AUT0 magnitudes of i < 22.5, and thus comprise a 
very similar sample to the one presented in this paper. To facilitate 
the comparison further we have chosen our photo -2 bins similarly to 
those in the clustering study of |Coupon et al.| ( f2012[ >. Although pho¬ 
tometric redshifts in CFHTLS were estimated using a different tem¬ 
plate method, LePhare jllbert et^p006) l, [Coupon et al.| ( |2009^ ) we 
have used a very similar set of templates to obtain our DES BPZ data 
( [Sanchez et al.|20T4| >. The comparison to a neural network photo -2 
catalog such as TPZ is novel in this regard. 

The apparent magnitude sample i < 22.5 in [Coupon et ar]p012| l 
was then split and reported as several volume limited absolute mag¬ 
nitude “threshold” samples in each photo -2 bin. Therefore in each 
redshift bin, our sample (also selected by i < 22.5) corresponds to 
the faintest absolute magnitude sample in |Coupon et al.|p012^ , except 
for the fact that [Coupon et al.[ ( |2012| l imposed cuts to make the sample 
volume limited, where we have not. Eor example, for 0.4 < 2 < 0.6 
there is a group of galaxies brighter than i = 22.5 but fainter than 
Mr — 51og/i = —18.8 (see their Pig. 4). These galaxies are in our 
sample but not in the Mr — 5 logj^Q h < —18.8 sample of [Coupon] 
|et al.| ( |2012[ (. In Appendix |C| we estimate the impact of these differ¬ 
ences in selection when comparing b{z) results. We find the differ¬ 
ences are most relevant at high 2 , where the galaxies in the samples 
have luminosity thresholds, L, such that db/dL is large. 

The bias evolution reported by[Coupon et al.[(2012|l, accounting 
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Figure 12. Full set of two-point cross-correlation functions between all of the 
BPZ redshift bins we use. The theory curves represent a Planck fiducial model, 
with linear bias values fitted to the auto-con'elations displayed in Fig. |lQ| over 
scales 9 > 0.2 deg. Cases where this produces a match to the observed cross¬ 
correlations demonstrates the robustness of the dnidz estimation. 


for the differences in sample selection as described in Appendix 
is shown by a filled light-red region in Fig. |TT] (where the width cor¬ 
responds to the statistical errors bars reported by CFHTLS) 0 The 
green squares with error bars correspond to our bias measurements 
for the BPZ sample presented in the previous section and in Table 
1^ while the blue squares correspond to our TPZ sample as reported 
in Table (both for the cases where all corrections for observational 
systematics have been applied). 

Overall, the match in the recovered bias as a function of redshift 
between our apparent magnitude sample and the corresponding one 
by CFHTLS is good, with a slight tension at the highest bin for BPZ. 
Note however that for CHFTLS the error bars depicted in Fig. [m are 
statistical only (as reported by [Coupon et al.| \2012) ), while in our 
case we include effects such as the reduction of effective area due 
to further masking of bad observing regions, or the impact of stellar 
contamination. If we define 

E (^DES - &cfhtls)V(4^V (28) 

bin j = l 

and add the errors in quadrature, CTj = cfhtls we 

find that x^{BPZ) = 8.7 while y^iTPZ) = 4 for five degrees 
of freedom in each case. Assuming that the DBS and CFHTLS data 
are drawn from the same underlying Gaussian distribution, greater yd 
would be expected in 55 per cent of cases for the TPZ comparison 
and 12 per cent of cases for the BPZ comparison. This means that the 
agreement is better than Icr for TPZ and better than 2a for BPZ. 

We stress that the agreement for the bias evolution between the 
two photo -2 catalogs is non-obvious because different photo -2 codes 


We have re-scaled the CFHTLS measurements that assumed at 2 = 0 
erg = 0.8 to our cosmology erg = 0.83 


® 25 
^ 20 

15 

n‘ 10 
V 5 

d 0 


Photo-z: TPZ (machine learning) 

Theory: best fit to auto-correlation 

0 

0 20 

V 

t 1® 
N*" 10 

1x3 

fttrrmT-* 

2x3 



2 20 

V 

i 15 
10 

V 5 

CD 

1x4 

2x4 

3 x 4 







CM 25 
20 
V 

t 1® 
n‘ 10 

^ 5 

q 

1x5 

2x5 

3x5 

4x5 

V... 

-5 



I * 1 i 5 5 i . ■ * 



0.1 1.0 0.1 1.0 0.1 1.0 0.1 1.0 
0.2 < < 0.4 0.4 < < 0.6 ^^^^0.^6 < < 0.6 0.8 < z,„ < 1.0 


Figure 13. Same as Fig.|12|for the photometric catalog derived with TPZ. 


could, in principle, select different types of galaxies in different red¬ 
shift bins (note that the algorithms implemented in BPZ and TPZ are 
very different). This might explain the difference at the last 2 -bin if 
BPZ were selecting a rarer population with a higher bias. However 
we note that if the two samples were independent, the disagreement 
would be 2.\a, and thus not remarkable. For 2 < 1.0, the two tech¬ 
niques return samples with bias values that agree to better than 2 per 
cent, suggesting that at these redshifts, systematic uncertainties asso¬ 
ciated with the photo- 2 S in this redshift range are well-controlled. 

The increase in bias with redshift found in Fig.[TT]is primarily 
a consequence of the effective luminosity threshold, as this threshold 
naturally increases with redshift for a flux limited sample. Only the 
high luminosity galaxies will be observed at high 2 , and they are more 
biased. This effect accentuates for 2 > 0.8 as the majority of galaxies 
come from the exponential tail of the luminosity function. The 6 ( 2 ) 
shown in Fig.[^is therefore not physical in the sense that the sample 
is changing with redshift. In Appendix [A| we show that this evolution 
of galaxy bias as a function of redshift is also recovered in simulations 
based on Halo Occupation Distribution techniques to populate halos 
with galaxies. 

7.2 Consistency from cross-correlations of photo -2 bins 

As a further test of potential systematics and photo -2 we show in 
Figs. [T^ and the full set of cross-correlation measurements be¬ 
tween photo -2 bins for the BPZ and TPZ samples, respectively (see 
IChristodoulou et ^ ( [2012^ for a related analysis using SDSS data). 
These measurements were done using a cumulative mask, for each 
photo -2 estimator, combining the ones derived for each photo -2 bin 
in Sec. 16.II 

For bins that are well-separated in redshift, we observe that their 
cross-correlations are consistent with zero, as expected from the red¬ 
shift distributions in Fig. One notable exception is the measured 
cross-correlation between bins 0.2 < 2 < 0.4 and 1.0 < 2 < 1.2 for 
BPZ, which is significantly negative. Understanding the cause of this 
is left for future work. 
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Solid lines in Figs. 1 1 2| and 1 1 3| correspond to our fiducial model, 
given by Eq. for Wij provided with the redshift distributions in 
photo -2 bin i and j. They use a bias b = y/bibj where bi and bj are 
the the best fit “linear growth” model bias from the auto-correlations. 
This is therefore a fully predictive model given the auto-correlations, 
and a test for outliers and the reconstructed dn/dz. 

In Tables and js we compare the predicted y^bibj with bias 
values determined by directly fitting the cross-correlation functions 
over the same range of scales (6 > 0.2 deg), which we call bt n We 
estimate the error on the bias prediction y/bjhj by a direct propagation 
of the error on the auto-correlation biases, (Ji,. and , 

^ (Vbibj/2){abjbi + Ohjbj). (29) 

The fourth column displays the absolute difference between \Jbihj 
and bij divided by their error combined in quadrature. In the 20 total 
cases tested (considering both photo -2 methods), we find only one 
case where Ab/ub is greater than 2.21<j (BPZ bin 3x5) and only two 
cases where they are greater than \A5a (the additional case is TPZ 
bin 1X 2). Considered in total, the cross-correlation results agree with 
the auto-correlation results at roughly the expected statistical level. 

The picture recovered from Figs.[^and[T^and Tables|^and|^is 
that outliers are not causing obvious inconsistencies in our analysis, 
and that to a large extent our estimated redshift distributions are cor¬ 
rect. However, future DES data will require more work to assess the 
level of systematic uncertainty associated with photo —2 estimation. 


7.3 On the Scale of Linear Bias 

In Sec.j^and Eig.[T^we showed that on large angular scales the clus¬ 
tering recovered in our data agrees well with the standard linear theory 
prescription: 6 ^ iodm, Lin(0) for all photo -2 bins. We further noted 
that at smaller scales the assumption of linear bias appeared to still 
hold provided the description of the underlying dark matter cluster¬ 
ing accounted for non-linear growth of structure (for which we used 
Halof it from |Takahashi et al.|2012| ). In this section we explore in 
quantitative terms the scale at which this assumption leads to a signif¬ 
icant data-theory discrepanc}p^ 

We define the transition from linear to non-linear dark mat¬ 
ter clustering as the minimum scale such that iudm, Haioflt = 
WDM.Lin. Given our photo -2 bins this occurs at ^Linear Growth ~ 

(0.26,0.16,0.11,0.09,0.08) deg. for bins one to five. This corre¬ 
sponds to roughly 4Mpc h~^ of comoving separation across redshift 
given our cosmological model. The best-fit biases and on linear 
scales (0 > ^Linear Growth) are provided in Table|^ 

To state at what point the “linear bias” model becomes notably 
worse than the “linear growth” model we perform the following test. 
We first find the using the data vector and covariance matrix for the 
angular bins where 6 > 6'Linear Growth and the “linear growth” model. 
Then we add new data to the fit by decreasing the minimum angular 
scale considered, Omin < ^Linear Growth, and using a “linear bias” 
model we determine the difference in x^ values between the extended 
data vector and the “linear growth” one. A large Ay^ (depending on 
the extra data-points, A(d.o.f.)) implies that a model with a single 
bias parameter is no longer a good description of the extended data 
vector. In quantitative terms, we define 6^111111 such that, 

Ax^ - A(d.o.f) = 4 (30) 


Note that we use the same range of scales here for all redshift bins, so that 
a fair compaiison is possible for all cross-correlations. The biases thus differ 
slightly than those that would be inferred from Table|^ 

In this section, we focus on the BPZ data for concreteness but the conclu¬ 
sions reached in this section do not significantly depend on this choice. 


BPZ cross-correlations 

Bin i X j 

bij 

\/bibj 

Ab/ab 

1x2 

0.98 ±0.07 

1.10 ±0.05 

1.45 

1x3 

0.00 ± 1.23 

1.12 ± 0.05 

0.91 

1x4 

0.01 ± 4.86 

1.19 ±0.05 

0.24 

1x5 

0.00 ±3.99 

1.44 ±0.07 

0.36 

2x3 

1.25 ±0.14 

1.26 ±0.05 

0.04 

2x4 

0.00 ± 1.64 

1.33 ±0.05 

0.81 

2x5 

0.04 ± 5.74 

1.61 ± 0.08 

0.27 

3x4 

1.22 ±0.13 

1.36 ±0.06 

0.97 

3x5 

3.27 ± 0.43 

1.65 ± 0.08 

3.74 

4x5 

1.96 ±0.12 

1.75 ± 0.08 

1.39 


Table 4. Compaiison of best-fit biases bij obtained from fitting the two-point 
cross-coiTelation functions between photo -2 bins i and j (displayed in Fig. |12^ 
vs. the prediction for such signal from the best-fit to auto-correlations in 2 -bin i 
and 2 -bin j, given by yjbibj. The fourth column shows the absolute difference 
Ab = \bij — yjbibj I divided by their eiTor combined in quadrature. 



TPZ cross-correlations 


Bin i X j 

bij \/^i 

Ab/ab 


1x2 

1.35 

± 

0.08 

1.13 

± 

0.05 

2.21 

1x3 

0.00 

± 

1.91 

1.12 

± 

0.05 

0.58 

1x4 

0.00 

± 

1.79 

1.24 

± 

0.06 

0.69 

1x5 

0.00 

± 

3.28 

1.36 

± 

0.07 

0.41 

2x3 

1.01 

± 

0.21 

1.25 

± 

0.05 

1.11 

2x4 

0.00 

± 

1.40 

1.39 

± 

0.06 

0.99 

2x5 

0.00 

± 

2.35 

1.53 

± 

0.08 

0.64 

3x4 

1.11 

± 

0.20 

1.37 

± 

0.06 

1.27 

3x5 

0.00 

± 

2.14 

1.51 

± 

0.07 

0.70 

4x5 

1.61 

± 

0.15 

1.68 

± 

0.08 

0.39 


Table 5. Same as Table but for the TPZ cross correlations displayed in 
Fig.[T3] 


as the point where a 2a preference for an improved (but unknown) 
model for the bias as a function of scale exists. We thus define the 
largest angular scale at which this condition is met as our measure¬ 
ment of ^LinearBias. In what follows, we refer to the results of this 
test as the results of the ‘x^ test’. 

The results of the y^ test are shown in Fig. 14 for all 2 -bins. We 
find 6 

Linear Bias ~ (0.06,0.05, 0.04,0.037, 0.007) deg. for bins one 
to five. We consider the accuracy at which we detect the breakdown 
of the “linear bias” scheme as half^the precision in the measurement 
of the correlation function at ^LinearBias. For our current SV sample 
this is ~ 2.5% for the central photo -2 bins, and ~ 4% for bins one 
and five. 

In Fig.[T^(top panel) we plot the resulting ^Linear Bias as a func¬ 
tion of redshift (solid purple curve) and compare it to ^Linear Growth 
(dashed orange curve). The scale where the “linear bias” model fails 
is approximately three times smaller than the scale where non-linear 
growth becomes important, except for the 1.0 < 2 < 1.2 bin, where 


Because w{9) oc 
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Figure 14. We estimate the smallest scale flmin at which a linear bias model 
is still a good description of the clustering by computing the difference 
between a fit extending to and a fit only on large linear scales. A Ax^ ~ 
4 + A(dof) roughly corresponds to 2cr evidence that the linear bias model 
fails to describe the data. 


we find a factor of 10. The bottom panel of Fig.[^shows the same in¬ 
formation but translated to co-moving separations at the centre of each 
redshift bin, assuming our fiducial cosmological model. The scale of 
linear growth is roughly constant with redshift, ~ 4 Mpc, while 
the scale of linear bias is ~ lh~^ Mpc (except in the last bin). 

In Fig[^we present, as a cross-check, the resulting best-fit bias 
as a function of the Omin tested when obtaining the test results. 
The dashed line in each panel shows the “linear growth” bias from 
Tables and 1 and 2. As we extend the fit down to smaller Smin we 
recover smaller error-bars, as expected. Further, the measured bias 
values remain consistent with the dashed line, until approximately the 
scale of Linear Bias determined using the x^ test. We find that in all 
redshift bins, the measured bias begins to be clearly under-estimated 
compared to the asymptotic large-scale value (this would be consis¬ 
tent with a negative 2nd-order bias parameter in the local Eulerian 
bias model) at close to the 6min determined using the test. 


0.25 

0.20 


“=0 rv 1 c 

U 0.15 

I 0.10 

0.05 

0.00 


- scale of linear growth 

- scale of linear bias 



0.4 0.6 0.8 1.0 



Figure 15. Top Panel: the evolution of the scale down to which the linear 
bias model (with non-lineai' dark matter) reproduces our angular clustering 
measurements, compared to the evolution of the linear bias + linear dark matter 
clustering. The former is valid to considerably smaller scales. Bottom Panel: 
Same information but translated into co-moving distances at the given redshift. 


These results are interesting because they imply that forthcom¬ 
ing DES studies might be able to incorporate higher signal-to-noise 
data into cosmological analyses that use the full shape of the correla¬ 
tion function information with a simple bias model. We do stress that 
a more thorough analysis would involve allowing for a more complex 
model with non-linear and/or non-local bias terms and then investi¬ 
gate how the best-fit value for the linear order term in such scheme 
compares to the linear model results we present. This is however be¬ 
yond the scope of the present paper and we postpone it to future work. 


8 CONCLUSIONS 

We present the first large-scale clustering analysis of a galaxy sample 
selected from the Science Verification Data of the Dark Energy Sur¬ 
vey. The sample is selected as an apparent magnitude limited sample 
i < 22.5, with no color selection except for very conservative cuts 
to remove color outliers. The sample extends from 2phot = 0.2 to 
Zphot = 1.2. We performed our analysis in five tomographic bins of 
width Azphot = 0.2. 

This paper has three main foci: 

(i) We perform a detailed analysis and amelioration of potential 
observational systematic effects. We use a set of maps for different 
variables that can modulate galaxy detection efficiency, which 
include varying observing conditions, stellar density, and Galactic 
dust. We analyze which of them are affecting our sample by means 
of angular cross correlations and a measurement of galaxy density 
as a function of the value of the potential systematic variable across 
the footprint. We then apply cuts to minimize these systematic 
correlations, with a small loss of statistical power. We then show 
how these two approaches complement and validate each other. 
These methods can be widely applied to future clustering studies, in- 
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Figure 16. Best fit bias as a function of the minimum angular scale included in 
the fit. The inset label “Linear Growth” indicates the range of scales where a 
model with linear bias and linear matter clustering (hence “linear growth”) 
applies in our data. The “Linear Bias” label corresponds to a model using 
nonlinear matter clustering (Halof it by Takahashi et al. (2012)) but keeping 
the bias linear. 


eluding our detailed accounting of the effects of stellar contamination. 

(ii) We evaluate the clustering in the five redshift bins comparing 
it against simple linear theory models. We find good agreement with 
these models and between the results we obtain from two separate 
photo -2 methods (one template based, the other machine learning) of 
determining DES photometric redshifts. Both sets of measurements 
are also consistent with bias measurements, determined at the same 
redshifts for the same i < 22.5 flux limit, obtained by the CHFTLS 
survey. We measure the cross-correlation between these redshift bins 
as a test of the photo -2 reconstruction, finding the cross-correlation 
matching predictions from the autocorrelation signal at the expected 
statistical level. 

(iii) We explore the regime of validity of our simple models that 


involve either linear / non-linear dark matter clustering, and a linear 
bias term. We find that, in angular clustering, the scale at which a lin¬ 
ear bias model is not consistent with our data is considerably smaller 
than the scale at which non-linear growth in the clustering of dark 
matter (as predicted by Halofit 2012) becomes important com¬ 
pared to the linear theory predictions. These results are relevant for 
probes that aim to combine weak lensing with large scale structure, as 
they suggest a linear bias model could be sufficient to scales as small 
as l/i“^Mpc. 

The data we analyze is approximately l/30th the size of the of 
final DES data se|^ In the near future the methods presented here 
will be used and extended to larger DES samples suitable to measure 
the Baryon Acoustic Oscillation scale, constrain the shape of the mat¬ 
ter power spectrum and measure the growth of structure. As part of 
this future analysis, we will use additional tools to robustly assess sta¬ 
tistical and systematic uncertainties. We will make greater use of the 
Balrog tool presented in |Suchyta et al.| ( |2015^ , which will enable 
both a larger data sample and a more comprehensive assessment of 
systematic variations. Most importantly, we will use detailed mock 
catalogues and simulated data to provide robust and accurate covari¬ 
ance matrix estimates. These simulations will also be used to assess 
the systematic uncertainty introduced by the various systematic miti¬ 
gation techniques and uncertainties in the photo- 2 s. Further, we will 
consider the impact of time-dependent variations in the data quality 
comprising the co-add. The work presented in this paper will help 
inform the creation of these simulations and we will continue to re¬ 
fine and apply the tools we have presented throughout this paper. In 
this sense, the methods and results we present represents a first step 
towards the goal of constraining cosmology with DES ESS measure¬ 
ments. 
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Figure 17. Top panel shows the evolution of galaxy bias with redshift in a flux 
limited sample (i < 22.5) similar to the one defined in this paper but selected 
from our MICECATv2.0 mock galaxy catalog. We display bias derived from 
galaxy auto-correlations = \/Wgglwmm and galaxy-matter cross corre¬ 
lation bx = Wgm/wmm- The evolution is similar to the one in the DES-SV 
data. Bottom panel shows the ratio of the two estimates, i.e. the stochasticity 
coefficient r = byijbx, which we found close to unity in the redshift range 
relevant for DES data. 

metric and Polarimetric Standardization, Sterken C., ed., p. 187 
Tyson J. A., Wittman D. M., Hennawi J. R, Spergel D. N., 2003, 
Nuclear Physics B Proceedings Supplements, 124, 21 
York D. G. et al., 2000, AJ, 120, 1579 


redshifts to avoid redshift space distortions, which can have a large 
effect when using narrow redshift bins as in Fig.|17|(|Eriksen & Gaz-| 
|tanaga |2014^ but are negligible (given scales and errors) for Az = 0.2 
bins used in this paper ([Crocce, Cabre & Gaztanag a|2011[[Ross et al.| 
|2011b| l. The MICECATv2.0 bias is fitted from the ratio in the range 
0.2 < 0 < 2.0 deg. The solid line is the bias measured from cross- 
correlating galaxies and mass directly, bx = Wgmiwmm- The shaded 
region tries to reproduce the expected sampling and shot-noise errors 
for DES SV in 6 a. It corresponds to A6 a = {Awgg/wmm)/{‘2bA), 
where matter is kept fixed and the error in the galaxy-galaxy correla¬ 
tion Awgg is scaled to 116 deg^ and Az ~ 0.2, from the scatter in 
24 JK regions over a larger 30x30 deg^ simulation patch. The smaller 
error bars include sample variance cancelation between matter and 
galaxy fluctuations (i.e. the scatter in bx from different regions). The 
dotted line shows a cubic fit in simulations to bx and errors. The evo¬ 
lution with redshift is very similar to the one in the ESS bench-mark 
sample, in particular the fast rise at high redshift. AXz < 0.5 DES-SV 
data seems to be Icr lower than simulations. Larger areas are needed to 
decide if this is a significant discrepancy. The bottom panel of Fig.|17| 
shows the stochasticity coefficient: r = Wgm/yJWggWmm = bx/bA- 
The error bars are scaled from the dispersion in r from JK regions: 
thus, they include sample variance cancelation and correspond to the 
30x30 deg. patch used for this test. There seems to be some significant 
detection of r at the smallest redshifts, but this is most likely a reflec¬ 
tion of the poor performance of JK errors for the smallest volumes. 
At z > 0.3 we find r to be in the range r ~ 0.98 — 1.00, which is 
consistent with unity when errors are scaled to the SV area. This is 
relevant for analyses using probes that cross correlate the bench-mark 
galaxy sample defined here with other data sets such as CMB lensing 
( [Giannantonio et al.|2015^ or weak lensing maps. 

Note that the bias is displayed in units of (crg/O.SS) to account 
for the difference in the MICE value of ag = 0.80 and the one used in 
this paper (a — 0.83). There is quite a good agreement in the values 
of b{z) without need of any other adjustment. This illustrates the fact 
that the steep rise of bias at 2 > 1 is in fact expected using simple 
HOD and luminosity evolution models. 


APPENDIX 

A Testing the bias evolution against MICE bench-sample 


Here we test the evolution of bias for our flux-limited sample us¬ 
ing the MICECATv2.0, simulation which is an updated version from 
the MICECATvl.O catalog (see Eosalba et al.H2015a| (, |Crocce et al.j 
( |2013t , [Eosalba et al.j j2015b[ (, [Carretero et al'| ( |2015| () extended to 
lower mass halos and therefore less luminous galaxies. This version is 
complete to i ~ 24 at all redshifts. Magnitudes are assigned to galax¬ 
ies following a combination of Halo Occupation Distribution (HOD) 
and subhalo abundance matching (SHAM) prescriptions tuned to fit 
the local (2 ~ 0) galaxy luminosity function and clustering as a func¬ 
tion of luminosity and color (see [Carretero et ar] ( |2015) ). To account 
for evolution in galaxy luminosity on top of the one induced by halo 
mass evolution, we transform the galaxy magnitudes using: 


ievoi = i — 0.8[afan(1.52) — 0.1489], 


(31) 


which gives similar counts to those in COSMOS and other galaxy 
surveys (Castander et al, in prep.). We then restrict the sample to 
ievoi < 22.5 to compare to the DES SV bench-mark sample. 

The top panel of Eig.^] shows a comparison of the linear bias 
measured in MICECATv2.0 against the one in our BPZ and TPZ sam¬ 
ples shown in Eig. M In the case of MICECATv2.0 we estimate the 
bias as 6 a = ^/wggIwmm, where Wgg is the measured galaxy angu¬ 
lar correlation function and Wmm the corresponding measured matter- 
matter correlation in the same redshift bin. This is done using true 


B Statistical error estimation 

In Sec. |2.3| we introduced our approach to estimate the covariance 
matrix between angular clustering measurements, see Eq. ijTTJ. This 
combined a theory modeling for the off-diagonal elements with a 
Jackknife (JK) approach for the variance. The JK approach is intrinsi¬ 
cally limited by the fact that typically (given the size of our footprint 
and data) is not possible to extract many statistically independent sam¬ 
ples to construct a reliable full covariance. However it does account 
better than theory estimations for issues such as small scale mask ef¬ 
fects, boundaries, non-linerities and/or contamination or systematics 
in the data. On the other hand, the theory estimation yields off diag¬ 
onal elements that have no noise compared to their JK counter-parts. 
In this appendix we explore this choice in more detail. 

A first test is to compare the error estimates on w{6) measure¬ 
ments (i.e. the variance) that we obtain using the JK approach with 
the theory ones. Eor the theory estimate we used the best-fit bias over 
linear-scales, the exact footprint size and the exact shot-noise mea¬ 
sured in the data (i.e. number density of objects in the photo -2 bin 
after masking). The JK one employs 40 equal size JK regions, and we 
tested that the results that we now present do not depend on variations 
in this set-up (we have explored njx from 20 to 100). In Eig.|18|we 
show how these two estimates compare to each other. On large-scales 
(0 > 0.1 deg at low 2 , and 6 > 0.01 deg at high 2 ) they match each 
other well, except for the 0.8 < 2 < 1.0 where the theory yields 
~ 20% — 25% higher errors in the range 0.1 deg < 6 < 1 deg. 
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Photo- 2 : Bin 

Jack-knife Cov. 

Theory Cov. 

Mix Cov. 

0.2 < 

2 < 0.4 

1.05 ±0.07 

1.04 ± 0.08 

1.05 ± 0.07 

0.4 < 

2 < 0.6 

1.22 ±0.04 

1.23 ±0.06 

1.23 ± 0.05 

0.6 < 

2 < 0.8 

1.37 ± 0.03 

1.36 ±0.04 

1.36 ± 0.04 

0.8 < 

2 < 1.0 

1.55 ±0.02 

1.54 ± 0.04 

1.54 ± 0.02 

1.0 < 

2 < 1.2 

2.16 ±0.12 

2.16 ±0.12 

2.17 ± 0.09 


Table 6. Dependence of best-fit bias on different methods to estimate the co- 
variance matrix (see text for details). The bias is obtained from fitting the auto¬ 
correlations in Fig. |ll| over a linear range of scales This paper uses the mixed 
approach for its main results. 


A second test is to derive the best-fit bias using three possi¬ 
ble methods for constructing the covariance matrix, a full theory ap¬ 
proach, the JK method, or the combined one that we use in Sec. |2.3| 
This is shown in Table The best-fit biases are not affected by the 
choice of covariance, while the la errors vary, but not so much as to 
affect any of our conclusions. At smaller scales, the JK uncertainties 
are significantly larger than the theoretical ones; this is likely due to 
both non-linear and small-scale mask effects. 


C Galaxy Bias in CFHTLS 


Galaxies in [Coupon et al.| ( [2012^ are first selected according to 
SExtractor MAG_AUTO magnitudes of i < 22.5, similar to the 
sample presented in this paper. Their flux limited sample was then 
split in several volume limited absolute magnitude "threshold” sam¬ 
ples in each photo-a bin. Therefore in each redshift bin, our sample 
corresponds to the faintest absolute magnitude sample in jCoupon et al.j 
^2012) , except for the fact that our sample is not volume limited. 

For example over the redshift range 0.4 < 2 < 0.6 the faintest 
volume limited sample is defined as brighter than Mr — 5 log h = 
— 18.8 because this is the absolute magnitude corresponding to ap¬ 
parent magnitude i < 22.5 for the farthest galaxies a.1 z — 0.6. At 
2 = 0.4 the absolute magnitude corresponding to i < 22.5 is one 
magnitude fainter, Mr — 51og/i = —17.8. Therefore as one spans 
the whole redshift range of the sample from 2 = 0.4 to 2 = 0.6 there 
is a set of galaxies brighter than Mr — 5 log h = —17.8 but fainter 
than Mr — 51og/i = —18.8, that do not enter the volume limited 
sample in 0.4 < 2 < 0.6 but do enter the flux limited one. And this 
holds for all photo -2 bins: there is always one magnitude difference 
between the faintest absolute magnitude from the lowest to the highest 
redshift of each photo -2 bin ( [Cou p on et al.|201^ . 

To directly compare our bias measurements to those in [Coupon] 
[et al.[ ( |20T^ we need to know how bias depends on luminosity to then 
estimate the contribution to the bias of this “extra” set of galaxies, as 


/■ 


^flux lim, bin N — ^vollim('\ M^r T 


(32) 


where Mr is the magnitude threshold defining the faintest sample for 
the photo -2 bin N. Here a; = 1 would correspond to the lowest redshift 
bound of such bin, while x = 0 the highest. 

At low redshift the difference discussed above does not impact 
the derived bias strongly because these galaxies are faint enough that 
have biases close to one and are in a regime of very weak dependence 
with luminosity. However as we move to higher redshifts, the faintest 
samples move to Mr — 5 logj^Q h < —21 where bias evolves strongly 
with luminosity (e.g. see Fig. 18 in [Coupon et al.[ ( |2012^ ). 

To account for this effect so as to properly compare our bias re¬ 
sults to those of CFHTLS we did as follows. We collected all the 
best-fit biases reported by [Coupon et al.| ( [2012^ in their Table B.l. 
This is shown in Fig. [T^ We have found that these measurements 
are very well fit by a simple linear relation with r-band luminosity 



9 [deg] 

Figure 18. Comparison of Jackknife resampling errors (points) and theoreti¬ 
cally derived ones (lines) for our five tomographic bins (BPZ photo- 2 ). For the 
former we used 40 equal size regions. The theory estimation uses the best-fit 
bias, the projected number density and the area of the footprint (and its based 
on the nonlinear matter power spectra). The agreement is very good on scales 
(6 > 1 deg at low -2 and 0 > 0.1 deg at high- 2 ), while on smaller scales the 
structure of the data as captured by the JK resampling yields larger variance. 


b{Mr) = bo{z) -F aoLr with constant oo = 0.13/Z/J~°'^. This 
result is in fact very similar to the one discussed by [Coupon et al.[ 
\2t)\2) , although we do not normalize by the characteristic luminos¬ 
ity at the given redshift. We then fit 6 o at every photo -2 bin, includ¬ 
ing 1.0 < 2 < 1.2 where only one data-point is given by [Coupon] 
et^([M^. We find bo = (1.08,1.18,1.25,1.35,1.4) at photo -2 


bins centered at (0.3, 0.5, 0.7, 0.9,1.1). These results are displayed 
in Fig.fT^ 

Once provided with a fit for 6 (< Mr) we computed the bias 
conversion as in Eq. l [32[ (. Results are detailed in Table [^ T he fourth 
column displays the values that we use in Sec.j^and FigTfl 1 [ Because 
the effect is not strong and the bias range not large, the results from an 
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Figure 19. Fit to the bias as a function of luminosity presented by Coupon et 
al. (2012) for the 5 bins also used throughout our paper (starting from 0.2 < 
z < 0.4 at the bottom). The evolution can be simply described as depending 
linearly with luminosity (see inset label). 


Photo- 2 : Bin 

Mr — 5 log h 

^vol lim(^ Mr) 

i>fluxlim(< Mr) 

0.2 < z < 0.4 

-17.8 

1.08 ±0.01 

1.09 

0.4 < z < 0.6 

-18.8 

1.23 ±0.01 

1.22 

0.6 < z < 0.8 

-19.8 

1.40 ±0.01 

1.35 

0.8 < z < 1.0 

-20.8 

1.74 ±0.03 

1.59 

1.0 < z < 1.2 

-21.8 

C) QQ“l"U.Ub 

z.OO-o 06 

2.00 


Table 7. Translation of the biases defined by Coupon et al. (2012) where sam¬ 
ples are defined in terms of volume limited quantities to our case where we 
have a flux-limited sample across the full redshift range (the parent catalog in 
both cases is similar, i — bandMAG_AUT0 < 22.5). Here Mr is the absolute 
magnitude corresponding to the faintest sample in each photo- 2 : bin as defined 
by Coupon et al. (2012), bvoiiim values reported in their Table 

B.I., and bfluxiim fbe ones directly comparable to our results. 


and 


(No) = il + fst){Ngai). (35) 

Given the over-density 

5 = N/{N) - 1, (36) 

we have 

^ 1 1 N.tF{S) _ 

° 1 + fst {N,al) 1 + fst {N,al) ’ ^ ^ 

which can be re-written in terms of the galaxy over-density 
e _ -fst ^ 1 NstF{S) 

1 + fst ^ 1 + fst (N.rrl) ■ ^ ' 

and then the over-density of the stellar contamination, 5st,s = 


NstFjS) 

{NstF(S)) 


- 1 


J- Sgal fst I /. /r , 1 N ^gal I /• r , fst 

Oo = -h Jst(Ost,S + f) = T~r~e - 1" /stOat.S + 


1 + fs, 


1 + fs, 


r ■ ■ , (JV,* gF(S)) , 

after recognizing that = /«*• 


1 + fst’ 

(39) 


The measured angular correlation function, Wo = (SoSo), is 

ri 


Wo = 


'^gal 


+ fstWst,S + 


(40) 


(l + /.t)" ' ' (l + fstV’ 

(assuming no true correlation between galaxies and stars). Therefore 


Wgal = (1 + fst) Wo - fstWst,S - 


ftt 


(1 + fst)^ 


(41) 


The above requires no approximations, but is limited to the spe¬ 
cific case of contamination. One must be able to estimate Wst.s. but 
we expect this can be approximated as Wst in many cases. Further, es¬ 
timating the stellar contamination as a function of survey conditions 
can be done empirically and using forward-modeling tools (the po¬ 
tential exists when using similar tools as, e.g.,| Leistedt et al!| ( |2015^ ; 
ISuchyta et al.|p0T5) ). 


This paper has been typeset from a TgX/ KTgX file prepared by the 
author. 


integration as in Eq. l |32| l are equivalent to simply computing the bias 
for samples shifted half by a magnitude (i.e. bfluxiim(< -17.8-1-0.5) 
for 0.2 < a < 0.4, then fofluxiim(< —18.8 -I- 0.5) for 0.4 < 2 < 0.6 
and so on). 


D Stellar Contamination 

Building from the results of, e.g., [Myers et al.| ( |2006) ; [Ross et 
( |201 la[ >; |Ho et al.] \2Q\2) , we derive the effect that stellar contami¬ 
nation has on the measured clustering of a galaxy sample. We assume 
the observed number of galaxies in some location. No, is a function 
of the true number of galaxies, the number of stars, Nst, that could be 
mis-classified as a galaxy, and some function of the survey conditions, 
S. If the only source that modulates the observed number of galaxies 
is stellar contamination, then 


No = Ngoi + NstF{S), (33) 


where NstF{S) is the number of stars mis-classified as galaxies at a 
particular location. The fractional stellar contamination, fst, over the 
footprint is 


{NstfjS)) 

(No) 


(34) 


AFFILIATIONS 

^Institut de Ciencies de I’Espai, lEEC-CSIC, Campus UAB, Carrer 
de Can Magrans, s/n, 08193 Bellaterra, Barcelona, Spain 
^Institut de Fisica d’Altes Energies, Universitat Autdnoma de 
Barcelona, E-08193 Bellaterra, Barcelona, Spain 
^ Center for Cosmology and Astro-Particle Physics, The Ohio State 
University, Columbus, OH 43210, USA 

‘^Centro de Investigaciones Energeticas, Medioambientales y Tec- 
noldgicas (CIEMAT), Madrid, Spain 

^Department of Astronomy, University of Illinois, 1002 W. Green 
Street, Urbana, IL 61801, USA 

^Kavli Institute for Cosmology Cambridge and Institute of Astronomy, 
University of Cambridge, Madingley Road, Cambridge CB3 OHA, UK 
^Fermi National Accelerator Laboratory, P. O. Box 500, Batavia, IL 
60510, USA 

^Laboratorio Interinstitucional de e-Astronomia - LIneA, Rua Gal. 
Jose Cristina 77, Rio de Janeiro, RJ - 20921-400, Brazil 
^Department of Physics & Astronomy, University College London, 
Gower Street, London, WCIE 6BT, UK 
National Center for Supercomputing Applications, 1205 West Clark 
St., Urbana, IL 61801, USA 

Observatorio Nacional, Rua Gal. Jose Cristina 77, Rio de Janeiro, 

© 0000 RAS, MNRAS 000,[fH23] 


































Galaxy clustering, photo-z and systematics in DES-SV 23 


RJ - 20921-400, Brazil 

^^Kavli Institute for Cosmological Physics, University of Chicago, 
Chicago, IL 60637, USA 

Department of Physics, ETH Zurich, Wolfgang-Pauli-Strasse 16, 
CH-8093 Zurich, Switzerland 

Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berke¬ 
ley, CA 94720, USA 

Institute of Cosmology & Gravitation, University of Portsmouth, 
Portsmouth, POl 3FX, UK 

^^Instituto de Fi'sica Tedrica and ICTP SAIFR, Universidade Estad- 
ual Paulista, Rua Dr. Bento T. Ferraz 271, Sao Paulo, SP 01140-070, 
Brazil 

Kavli Institute for Particle Astrophysics & Cosmology, P. O. Box 
2450, Stanford University, Stanford, CA 94305, USA 
^^SLAC National Accelerator Laboratory, Menlo Park, CA 94025, 
USA 

^^Cerro Tololo Inter-American Observatory, National Optical Astron¬ 
omy Observatory, Casilla 603, La Serena, Chile 
^^Kavli Institute for Cosmology, University of Cambridge, Madingley 
Road, Cambridge CB3 OHA, UK 

Department of Physics and Astronomy, University of Pennsylvania, 
Philadelphia, PA 19104, USA 

^^CNRS, UMR 7095, Institut d’Astrophysique de Paris, F-75014, 
Paris, France 

Sorbonne Universites, UPMC Univ Paris 06, UMR 7095, Institut 
d’Astrophysique de Paris, F-75014, Paris, France 
Excellence Cluster Universe, Boltzmannstr. 2, 85748 Garching, 
Germany 

Faculty of Physics, Ludwig-Maximilians University, Scheinerstr. 1, 
81679 Munich, Germany 

^^Jet Propulsion Laboratory, California Institute of Technology, 4800 
Oak Grove Dr., Pasadena, CA 91109, USA 
Department of Astronomy, University of Michigan, Ann Arbor, MI 
48109, USA 

Department of Physics, University of Michigan, Ann Arbor, MI 
48109, USA 

^^Max Planck Institute for Extraterrestrial Physics, Giessenbach- 
strasse, 85748 Garching, Germany 

Universitdts-Sternwarte, Fakultdt fur Physik, Ludwig-Maximilians 
Universitdt Munchen, Scheinerstr. 1, 81679 Munchen, Germany 
Department of Physics, The Ohio State University, Columbus, OH 
43210, USA 

Australian Astronomical Observatory, North Ryde, NSW 2113, Aus¬ 
tralia 

George P. and Cynthia Woods Mitchell Institute for Fundamental 
Physics and Astronomy, and Department of Physics and Astronomy, 
Texas A&M University, College Station, TX 77843, USA 
^'^Departamento de Fi'sica Matemdtica, Institute de Ftsica, Universi¬ 
dade de Sao Paulo, CP 66318, CEP 05314-970, Sao Paulo, SP, Brazil 
Department of Astronomy, The Ohio State University, Columbus, 
OH 43210, USA 

Department of Physics and Astronomy, Pevensey Building, Univer¬ 
sity of Sussex, Brighton, BNl 9QH, UK 
Institute de Ftsica, UFRGS, Caixa Postal 15051, Porto Alegre, RS 
- 91501-970, Brazil 

Department of Physics, University of Illinois, 1110 W. Green St., 
Urbana, IL 61801, USA 

^^Argonne National Laboratory, 9700 South Cass Avenue, Lemont, IL 
60439, USA 

Department of Physics, Stanford University, 382 Via Pueblo Mall, 
Stanford, CA 94305, USA 

*^Jodrell Bank Center for Astrophysics, School of Physics and Astron¬ 
omy, University of Manchester, Oxford Road, Manchester, Ml 3 9PL, 
UK 


Centre for Theoretical Cosmology, DAMTP, University of Cam¬ 
bridge, Wilberforce Road, Cambridge CB3 OWA, United Kingdom 
Department of Physics and Electronics, Rhodes University, PO Box 
94, Grahamstown, 6140, South Africa 


© 0000 RAS, MNRAS 000, [TH^ 


